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POLYNUCLEOTIDES AND POLYPEPTIDES ASSOCIATED WITH 
ANTIBIOTIC BIOSYNTHESIS AND USES THEREFOR 

FIELD OF THE INVENTION 

THIS INVENTION relates generally to antibiotic biosynthesis. More particularly, 
5 the present invention relates to polyketides and the polyketide synthases and ancillary 
enzymes that are capable of producing such compounds. Even more particularly, the 
present invention relates to a polyketide synthase linked to a non-ribosomal peptide 
synthetase involved in the biosynthesis of albicidins, to a phosphopantetheinyl transferase 
for activating enzymes, particularly polyketide syntliases and/or non-ribosomal peptide 

10 synthetases, associated with the biosynthesis of albicidins, and to a methyltransferase for 
methylating precursors of albicidins and/or intermediates related to albicidin biosynthesis. 
The present invention also relates to biologically active fragments of the aforementioned 
polypeptides and to variants and derivatives of these molecules. Further, the invention 
relates to polynucleotides encoding the said polypeptides, including the xabA, xabB and 

15 xabC genes oi Xanthomonas albilineans, to polynucleotides encoding the said fragments, 
variants or derivatives, to vectors comprising the said polynucleotides and to host cells 
containing such vectors. The invention also relates to a transcriptional control element for 
modulating the expression of polynucleotides including, for example, itiQxabB gene and/or 
thQ xabC gene oi Xanthomonas albilineans, or variants thereof The invention also features 

20 metliods of using the polynucleotides, polypeptides, fragments, variants, derivatives and 
vectors for activating polyketide synthases and/or non-ribosomal peptide synthetases, for 
methylating precursors of albicidins or their analogues and/or intermediates involved in the 
biosynthesis of albicidins or their analogues and for enhancing the level and/or functional 
activity of albicidins or their analogues. The invention also encompasses methods of using 

25 the aforesaid polynucleotides, polypeptides, fragments, variants and derivatives for the 
biosynthesis of albicidins or analogues thereof 

Bibliographic details of various pubUcations referred to by author in this 
specification are collected at the end of the description. 
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BACKGROUND OF THE INVENTION 

Polyketides represent a large structurally diverse group of compounds synthesised 
from 2-carbon units through a series of condensations and subsequent modifications. They 
possess a broad range of biological activities including antibiotic and pharmacological 
5 properties. For example, polyketides are represented by antibiotics such as tetracyclines, 
erythromycins, immunosuppressants such as FK506, FK520 and rapamycin. anticancer 
agents such as daunomycin and veterinary products such as monensin and avermectin. 

Considering tlie difficulty in producing polyketide compounds by conventional 
chemical methodologies, and the typically low production of polyketides in wild-type 

10 cells, there has been considerable interest in finding improved or alternate means to 
produce polyketide compounds. In this regard, reference maybe made to PCX publication 
Nos. WO 93/13663; WO 95/08548; WO 96/40968; WO 97/02358; and WO 98/27203; 
U.S. Pat. Nos. 4.874.748; 5,063.155; 5,098,837; 5.149.639; 5.672.491; and 5.712,146; Fu 
et al. (1994, Biochetnistry 33: 9321-9326); McDaniel et al. (1993, Science 262: 1546- 

15 1550); and Rohr (1995. Angew. Chem. Int. Ed. Engl. 34(8): 881-888). 

Polyketides are synthesised in nature by polyketide synthases (PKS). These 
enzymes, which are actually complexes of multiple enzyme activities, are in some ways 
similar to. but in other ways different from, the synthases that catalyse condensation of 2- 
carbon units in the biosynthesis of fatty acids. Specifically. PKS enzymes catalyse the 
90 biosynthesis of polyketides through repeated (decarboxylative) Claisen condensations 
between acylthioesters {e.g., acetyl, propionyl, malonyl or methylmalonyl). Following each 
condensation, they introduce structural variability into the product by catalysing all. part, 
or none of a reductive cycle comprising a ketoreduction, dehydration, and enoyhreduction 
on the ^-keto group of the growing polyketide chain. PKS enzymes incorporate enormous 
25 structural diversity into their products, in addition to varying the condensation cycle, by 
controlling choice of primer, extender units, and the overall chain length and, particularly 
in the case of aromatic polyketides, regiospecific cyclisation of tiie nascent polyketide 
chain. After the carbon chain has grown to a length characteristic of each specific product, 
it is released from the synthase by tiiiolysis or acyltransfer. Thus, the PKS complexes 
30 consist of families of enzymes which work together to produce a given polyketide. It is the 
choice of chain-building units, controlled variation in chain length, and the reductive cycle, 
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genetically programmed into each PKS, that contributes to the variation seen among 
naturally occurring polyketides. 

Two major types of PKS enz>Tnes are known; these differ in tlieir composition 
and mode of synthesis of the polyketide synthesised. These two major types of PKS 
5 enzymes are commonly referred to as Type I or "modular" and Type n "iterative" PKS 
enzymes. These classifications are well known and reference maybe made, for example, to 
Hopvvood and Khosla (1992). 

The Type I or modular PKS enzymes typically catalyse the biosynthesis of 
complex polyketides such as erythromycin and avermectin. These modular enzymes 

10 include assemblies of several large multifunctional proteins carrying, between them, a set 
of separate active sites for each step of carbon chain assembly and modification (Cortes et 
al, 1990; Donadio et al, 1991; MacNeil et al, 1992). Accordingly, modular PKS 
complexes can be viewed as biochemical assembly lines, composed of a series of catalytic 
domains involved in sequential assembly and modification of acyl groups on the growing 

15 polyketide chain (Cane et al, 1998; Keating and Walsh, 1999). The catalytic domams are 
arranged in "modules", punctuated by acyl carrier protein (AC?) domains that tether the 
nascent polyketide while it undergoes the catalytic modifications progranmied in the 
associated module. For each polyketide there is an initiation module, a series of elongation 
modules that define the length and structure of the polyketide chain, and a termination 

20 module to release the product from the final tether. The initiation module typically 
comprises an acyl transferase (AT) domam tiiat couples the initial acyl group from an acyl- 
CoA substrate to the phosphopantetheinyl tether of the first ACP domain. Each elongation 
module typically comprises a ketosynthase (KS), an AT and an ACP. The KS removes the 
growing polyketide unit from the upstream ACP and couples it to tiie next acyl group in 

25 the chain, which has aheady been selected and loaded by the AT onto the ACP in the same 
module. Other catalytic domains (eg. a ketoacyl reductase (KR), and dehydratase (DH)) 
within an elongation module can modify the newly elongated polyketide before it is 
transferred to flie next module in the biochemical assembly line. A thioesterase (TE) 
domam in the termination module accompUshes release of the assembled polyketide from 

30 the last ACP in the series (Cane et al, 1998; Keating and Walsh, 1999). 
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Biosynthesis of a polyketide can involve the sequential action of several PKS 
proteins, each with one to six elongation modules (MacNeil et al, 1992; Apricio et al, 
1996). There are variations on the modular PKS design, mcluding participation by some 
loading domains across modules or in trans from separate proteins (Keating and Walsh, 
5 1999), and several examples of hybrid PKS/NRPS proteins (Albertini et al, 1995; Gehring 
et al, 1998; Duitman et al, 1999; Paitan et al, 1999). Subsequent modification of the 
polyketide by dedicated tailoring enzymes is generally required to complete the 
biologically active product (Hopwood, 1997). Other biologically active compounds 
including antibiotics comprise polypeptides assembled by non-ribosomal peptide 

10 synthetases (NRPSs). NRPSs typically show a modular architecture and tethered 
biosynthetic strategy analogous to PKSs (Cane et al, 1998; Keating and Walsh, 1999). hi 
NRPSs a condensation (C) domain removes the growing peptide unit from the upstream 
PCP domam and couples it to the next amino acid group in the chain, which has akeady 
been selected and loaded by an adenylation (A) domain onto the PCP in the same module 

15 (Marahiel et al, 1997; Stachelhaus et al, 1998). Other catalytic domains (e.g., epimerase 
or N-methytransferase) within an elongation module can modify the newly elongated 
polypeptide before it is transferred to the next module in the biochemical assembly line 
(Marahiel etal, 1997). 

Many phytopathogenic bacteria and fungi secrete toxms with phytotoxic activity 
20 and a broad spectrum of antimicrobial properties (Guenzi et al, 1998). Albicidin 
phytotoxins are polyketides produced by Xanthomonas albilineans, which are key 
pathogenicity factors in the development of leaf scald, one of the most devastating diseases of 
sugarcane {Saccharum, mterspecific hybrids) (Ricaud and Ryan, 1989; Zhang and Birch, 
1997; Zhang et al, 1999). Albicidins selectively block prokaryote DNA repUcation and cause 
25 the characteristic chlorotic symptoms of leaf scald disease by blocking chloroplast 
development (Birch and Patil, 1983; 1985b; 1987a; 1987b). Because albicidins are rapidly 
bactericidal at nanomolar concentrations against a broad range of Gram-positive and 
Gram-negative bacteria, they are also of interest as potential clinical antibiotics (Birch and 
Patil, 1985a). 

30 The major antimicrobial component of the family of albicidins produced in 

culture by X. albilineans has been partially characterised as a low Mr compound with 
several aromatic rings (Birch and Patil, 1985a). Low yields have slowed studies into the 
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chemical structure of albicidin, its appUcation as a tool to study prokaryote DNA 
replication, and its development as a clinical antibiotic (Zhang et al, 1998). Genetic 
analysis of albicidin biosynthesis is likely to indicate approaches to increase yields, 
probable structural features, and opportunities for engineering novel antibiotics in this 
5 family. 
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SUMMARY OF THE INVENTION 

The present invention arises in part from the identification and characterisation of 
several X. albilineans genes associated with albicidin biosynthesis. In particular, the 
present inventor has isolated a novel X, albilineans gene {xabE), which encodes a large 
5 protein (predicted Mr 525,695), with a modular architecture indicative of a multifiirictional 
PKS linked to a non-ribosomal peptide synthetase (NRPS). At 4801 amino acids in length, 
the product of xabB pCabB) is the largest reported PKS-NRPS. Twelve catalytic domains 
in this multifunctional enzyme are arranged in the order N-terminus-acyl-Co A Ugase (AL)- 
acyl carrier protein (ACP)-)3-ketoacyl synthase (KS)-i3-ketoacyl reductase (KR)-ACP- 

1 0 ACP-KS-peptidyl carrier protein (PCP)-condensation domain (C)-adenylation domain (A)- 
PCP-C. The modular architecture of XabB indicates hkely steps in albicidin biosynthesis, 
and approaches to enhance antibiotic yield. The novel pattern of domains, in comparison 
with known PKS-NRPS enzymes for antibiotic production, also contributes to the 
knowledge base for rational design of enzymes producing novel antibiotics. The present 

15 inventor has found that XabB is required for the production of albicidins and that enhanced 
expression of xabB leads to increased levels and/or functional activities of albicidin 
antibiotics. 

A gene (xabC) encoding a novel 0-methyltransferase has also been isolated, 
which methylates albicidin precursors and/or intermediates involved in albicidin 
20 biosynthesis. Surprisingly, enhanced expression of xabC has been found to increase the 
levels and/or functional activities of albicidin antibiotics. 

The present inventor has also isolated a gene {xabA) encoding a 
phosphopantetheinyl transferase (PPTase), which is required for post-translational 
activation of synthetases in the albicidin biosynthetic pathway. In this regard, it is known 

25 that inefficient phosphopantetheinylation has limited the activity of other antibiotic 
synthetases overexpressed in heterologous species (Walsh et al, 1997). Accordingly, the 
isolated xabA gene, together with its target in the albicidin biosynthetic pathway (e.g., 
xabB), provide the means to engineer high level co-expression of the albicidin synthetase 
and its activating PPTase to obtain albicidins in higher yields, and ultimately to manipulate 

30 the elements of the albicidin biosyntlietic machinery, by mutagenesis or by other means, to 
produce desired structural variants of this novel antibiotic class. 
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The above genes, in whole or in part, together with their variants and derivatives, 
are useful inter alia for modulating the level and/or functional activity of albicidins, for 
expressing PKS enzymes in recombinant host cells, for producmg polyketides including 
■ albicidins and their analogues and for combinatorial biosynthesis, as described hereinafter. 

5 Accordingly, one aspect of the present invention contemplates an isolated 

polypeptide encoding at least a portion of an albicidin PKS-NRPS (XabB) or its variants or 
derivatives. In one embodiment of this type, the invention provides an isolated polypeptide 
comprising at least one domain selected from the group consisting of: 

(a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 
10 more of SEQ ID NO: 6 and 8, or variants thereof. 

(b) a j3-ketoacyl synthase (KS) domain comprising a sequence set forth in any one or 
more of SEQ E) NO: 10, 12, 14, 16, 18 and 20, or variants thereof; 

(c) a iS-ketoacyl reductase (KR) domain comprising the sequence set forth SEQ ID 
NO: 22, or variants thereof; 

15 (d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 

or more of SEQ ED NO: 24, 26 and 28, or variants thereof; 

(e) an adenylation (A) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof; 

(f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 
20 one or more of SEQ ID NO: 50 and 52, or variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 
thereof. 

Preferably, the AL domain comprises each of the sequences set forth in SEQ ID 
25 NO: 6 and 8, or variants thereof. 

In one embodiment, the KS domain preferably comprises each of the sequences 
set forth in SEQ ID NO: 10, 12 and 14, or variants thereof. In an alternate embodiment, the 
KS domain preferably comprises each of the sequences set forth in SEQ ID NO: 16, 18 and 
20, or variants thereof 



30 Preferably, the A domain comprises each of the sequences set forth in SEQ ID 

NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof 
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In one embodiment, the C domain preferably comprises each of the sequences set 
forth in SEQ ID NO: 54, 56, 58, 60, 62, 64 and 66, or variants thereof In an alternate 
embodiment, the C domain preferably comprises each of the sequences set forth in SEQ ID 
NO: 68, 70, 72, 74, 76, 78 and 80, or variants thereof 

5 In another embodiment, the invention provides an isolated polypeptide comprising 

at least a biologically active fragment or portion of the sequence set forth in SEQ ID NO: 
2, or a variant or derivative thereof 

Suitably, the biologically active fragment is at least 6 amino acids in length. 

In a preferred embodiment, the domains broadly described above are arranged in 
10 an N- to C-terminal direction as follows: AL-ACP-KS-KR-ACP-ACP-KS-PCP-C-A-PCP- 
C. 

Suitably, the biologically active fragment comprises at least one domain selected 
from tlie group consisting of the AL domain, the KS domain, the KR domain, the ACP 
domain, the A domain, the PCP domain and the C domain as broadly described above. 

15 Suitably, the variant has at least 60%, preferably at least 70%, more preferably at 

least 80%, more preferably at least 90% and still more preferably at least 95% sequence 
identity to the sequence set forth in SEQ ID NO: 2. 

Preferably, the variant comprises at least one sequence selected from the group 
consisting of SEQ ID NO: 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 

20 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variant 
thereof In this regard, the variant preferably has at least 70%, preferably at least 80%, 
more preferably at least 90%, and still more preferably at least 95% sequence identity to 
any one of the amino acid sequences set forth in SEQ ID NO: 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 

25 70, 72, 74, 76, 78 and 80. 

In another aspect, the present invention contemplates an isolated polypeptide 
encoding at least a portion of a PPTase (XabA) associated with albicidin biosynthesis or its 
variants or derivatives. In one embodiment of this type, the invention provides an isolated 
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polypeptide comprising at least biologically active fragment or portion of the sequence set 
forth in SEQ ED NO: 83, or a variant or derivative thereof. 

Suitably, the biologically active fragment comprises at least one, and preferably 
both, of the consensus PPTase sequence motifs set forth in SEQ ID NO: 89 and 93, or 
5 variant thereof. Preferably, the biologically active fragment comprises the intervening 
sequence between the said consensus PPTase sequence motifs, which mtervening sequence 
comprises the sequence set forth in SEQ ID NO: 91, or variant thereof 

Preferably, the biologically active fragment comprises a contiguous sequence of 
amino acids contained within the sequence set forth in SEQ ID NO; 87, or variant thereof 

10 Suitably, the variant has at least 60%, preferably at least 70%, more preferably at 

least 80%, more preferably at least 90% and still more preferably at least 95% sequence 
identity to the sequence set forth in SEQ ID NO: 83. 

Preferably, the variant comprises at least one sequence selected from the group 
consisting of SEQ ID NO: 87, 89, 91 and 93, or variant thereof In this regard, the variant 
15 preferably has at least 70%, preferably at least 80%, more preferably at least 90%, and still 
more preferably at least 95% sequence identity to any one of the amino acid sequences set 
forth in SEQ ID NO: 87, 89, 91 or 93. 

In yet another aspect, the present invention contemplates an isolated polypeptide 
encoding at least a portion of a methyltransferase (XabC) associated v^th albicidin 
20 biosynthesis or its variants or derivatives. In one embodiment of this type, the invention 
provides an isolated polypeptide comprising at least biologically active fragment or portion 
of the sequence set forth in SEQ ID NO: 95, or a variant or derivative thereof 

Suitably, the biologically active fragment comprises at least one, and preferably 
all, of the consensus methyltransferase sequence motifs set forth in SEQ ID NO: 99, 101 
25 and 1 03 , or variant thereof 

Preferably, the biologically active fragment comprises a contiguous sequence of 
amino acids contamed within the sequence set forth in SEQ ID NO: 105, or variant thereof 
In a preferred embodiment, the biologically active fragment comprises a contiguous 
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sequence of amino acids contained within the sequence set forth in SEQ ID NO: 107, or 
variant thereof. 

Suitably, the variant has at least 60%, preferably at least 70%, more preferably at 
least 80%, more preferably at least 90% and still more preferably at least 95% sequence 
5 identity to the sequence set forth in SEQ ID NO: 95. 

Preferably, the variant has at least 70%, preferably at least 80%, more preferably 
at least 90%, and still more preferably at least 95% sequence identity to any one of the 
amino acid sequences set forth in SEQ ID NO: 99, 101 and 103. 

In still yet another aspect, the invention contemplates an isolated polynucleotide 
10 encoding at least a portion of an albicidin PKS-NRPS (XabB) or its variants or derivatives, 
as broadly described above. Preferably, the polynucleotide comprises the sequence set 
forth in any one of SEQ ID NO: 1 and 3, or a biologically active fragment thereof, or a 
polynucleotide variant of these. 

Suitably, the biologically active fragment is at least 18 nucleotides in length. 

15 The polynucleotide preferably encodes at least one domain selected from the 

group consisting of the AL domain, the KS domain, the KR domain, the ACP domain, the 
A domain, the PCP domain and the C domain as broadly described above- 
Suitably, the AL domain is encoded by a nucleotide sequence set forth in any one 
or more of SEQ ED NO: 5 and 7, or variants thereof Preferably, the AL domain is encoded 

20 by a nucleotide sequence comprising each of the sequences set forth in SEQ ID NO: 5 and 
7, or variants thereof 

The KS domain is preferably encoded by a nucleotide sequence set forth in any 
one or more of SEQ ED NO: 9, 11, 13, 15, 17 and 19, or variants thereof In one 
embodiment, the KS domain is preferably encoded by a nucleotide sequence comprising 
25 each of the sequences set forth in SEQ ID NO: 9, 11 and 13, or variants thereof In an 
altemate embodiment, the KS domain is preferably encoded by a nucleotide sequence 
comprising each of the sequences set forth in SEQ ID NO: 15, 17 and 19, or variants 
thereof. 
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Preferably, the KR domain is encoded by a nucleotide sequence set forth in SEQ 
ID NO: 2 1 , or variant thereof. 

Suitably, the ACP domain is encoded by a nucleotide sequence set forth in any 
one or more of SEQ ID NO: 23, 25 and 27, or variants thereof 

5 The A domain is preferably encoded by a nucleotide sequence set forth in any one 

or more of SEQ ID NO: 29, 31, 33, 35, 37, 39, 41, 43, 45 and 47, or variants thereof In a 
preferred embodiment, the A domain is encoded by a nucleotide sequence comprising each 
of the sequences set forth in SEQ ID NO: 29, 31, 33, 35, 37, 39, 41, 43, 45 and 47, or 
variants thereof 

10 Suitably, the PCP domain is encoded by a nucleotide sequence set forth in any 

one or more of SEQ ID NO: 49 and 5 1 , or variants thereof 

Preferably, the C domain is encoded by a nucleotide sequence set forth in any one 
or more of SEQ ID NO: 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77 and 79, or variants 
thereof In one embodiment, the C domain is preferably encoded by a nucleotide sequence 
15 comprising each of the sequences set forth in SEQ ID NO: 53, 55, 57, 59, 61, 63 and 65, or 
variants thereof In an alternate embodiment, the C domain is preferably encoded by a 
nucleotide sequence comprising each of the sequences set forth in SEQ ED NO: 67, 69, 71, 
73, 75, 77 and 79, or variants thereof 

In one embodiment, the polynucleotide variant has at least 60%, preferably at 
20 least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the polynucleotides set forth in SEQ ID NO: 1 or 3. 

In another embodiment, the polynucleotide variant is capable of hybridising to 
any one of the polynucleotides identified by SEQ ID NO: 1 or 3 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
25 preferably under higji stringency conditions. 

Preferably, the polynucleotide variant comprises a nucleotide sequence encoding 
at least one domain selected from the group consisting of the AL domain, the KS domam, 
the KR domain, the ACP domain, the A domain, the PCP domain and the C domain as 
broadly described above. 
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In one embodiment, the nucleotide sequence variant has at least 60%, preferably 
at least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the sequences set forth in SEQ ED NO: 5, 7, 9, 11, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 
5 71, 73, 75, 77 and 79. 

In another embodiment, the nucleotide sequence variant is capable of hybridising 
to any one of the sequences identified by SEQ ID NO: 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 
73, 75, 77 and 79 under at least low stringency conditions, preferably under at least 
10 medium stringency conditions, and more preferably under high stringency conditions. 

In a further aspect, the invention contemplates an isolated polynucleotide 
encodmg at least a portion of a PPTase (XabA) associated with albicidin biosynthesis or its 
variants or derivatives, an isolated polynucleotide encoding a polypeptide, firagment, 
variant or derivative as broadly described above. Preferably, the polynucleotide comprises 
15 the sequence set forth in any one of SEQ ID NO: 82 and 84, or a biologically active 
fragment thereof, or a polynucleotide variant of these. 

Alternatively, the polynucleotide comprises a contiguous sequence of nucleotides 
contained within the sequence set forth in SEQ ID NO: 86, or variant thereof. 

In one embodiment, the polynucleotide variant has at least 60%, preferably at 
20 least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the polynucleotides set forth in SEQ ID NO: 82, 84 and 86. 

In another embodiment, the polynucleotide variant is capable of hybridising to 
any one of the polynucleotides identified by SEQ JD NO: 82, 84 and 86 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
2 5 preferably under high stringency conditions. 

Preferably, the polynucleotide variant comprises a nucleotide sequence encoding 
at least one PPTase sequence motif selected firom SEQ ID NO: 89 and 93, or variant 
thereof. Suitably, the polynucleotide variant comprises a nucleotide sequence encoding the 
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intervening sequence between the said consensus PPTase sequence motifs, said nucleotide 
sequence comprising the sequence set forth in SEQ ID NO: 91. 

The polynucleotide variant suitably comprises a nucleotide sequence encoding a 
contiguous sequence of amino acids contained within the sequence set forth in SEQ ID 
5 NO: 87, or variant thereof In this instance, the contiguous sequence is preferably encoded 
by the sequence set forth in SEQ ID NO: 86, or nucleotide sequence variant thereof 

Suitably, the PPTase sequence motif is encoded by a nucleotide sequence 
comprising the sequence set forth in any one of SEQ ID NO: 88 and 92, or nucleotide 
sequence variant thereof. 

10 Preferably, the said intervening sequence is encoded by the nucleotide sequence 

set forth in SEQ JD NO: 90, or nucleotide sequence variant thereof 

In one embodiment, the nucleotide sequence variant has at least 60%, preferably 
at least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the sequences set forth in SEQ ED NO: 86, 88, 90 and 92. 

15 In another embodiment, the nucleotide sequence variant is capable of hybridising 

to any one of the sequences identified by SEQ ED NO: 86, 88, 90 and 92 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
preferably under high stringency conditions. 

In yet a further aspect, the invention contemplates an isolated polynucleotide 
20 encoding at least a portion of a methyltransferase pCabC) associated with albicidin 
biosynthesis or its variants or derivatives. Preferably, the polynucleotide comprises the 
sequence set forth in any one of SEQ ID NO: 94 and 96, or a biologically active firagment 
thereof, or a polynucleotide variant of these. 

Alternatively the polynucleotide comprises a contiguous sequence of nucleotides 
25 contained wdthin the sequence set forth in SEQ ID NO: 104, or variant thereof In one 
embodiment, this polynucleotide preferably comprises a contiguous sequence of 
nucleotides contained within the sequence set forth in SEQ ID NO: 106, or variant thereof 
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In one embodiment, the polynucleotide variant has at least 60%, preferably at 
least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the polynucleotides set forth in SEQ ID NO: 94, 96, 104 and 106. 

In another embodiment, the polynucleotide variant is capable of hybridising to 
5 any one of the polynucleotides identified by SEQ ID NO: 94, 96, 104 and 106 under at 
least low stringency conditions, preferably under at least medium stringency conditions, 
and more preferably under high stringency conditions. 

Preferably, the polynucleotide variant comprises a nucleotide sequence encoding a 
methyltransferase sequence motif selected j&om any one or more of SEQ ID NO: 99, 101 
- 10 and 103, or variant thereof 

Suitably, the methyltransferase sequence motif is encoded by a nucleotide 
sequence comprising the sequence set forth in any one of SEQ ED NO; 98, 100 and 102, or 
nucleotide sequence variant thereof 

In one embodiment, the nucleotide sequence variant has at least 60%, preferably 
15 at least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the sequences set forth in SEQ ID NO: 98, 100 and 102. 

In another embodiment, the nucleotide sequence variant is capable of hybridising 
to any one of the sequences identified by SEQ ID NO: 98, 100 and 102 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
20 preferably under high stringency conditions. 

In still a fiirther aspect, the invention features an expression vector comprising a 
polynucleotide as broadly described above wherein the polynucleotide is operably linked 
to a regulatory polynucleotide. 

In another aspect, the invention provides a host cell containing a said expression 

25 vector. 

Suitably, the host cell is a bacterium or other prokaryote. 

In yet another aspect, the invention is directed to a multiplicity of cell colonies, 
constituting a library of colonies, wherein each colony of the Ubrary contains an expression 
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vector for the production of a polypeptide, fragment, variant or derivative as broadly 
described above. 

The invention also features a method of producing a recombinant polypeptide, 
fragment, variant or derivative as broadly described above, comprising: 
5 - culturing a host cell containing an expression vector as broadly described 

above such that said recombinant polypeptide, fragment, variant or derivative is 
expressed from said polynucleotide; and 

- isolating the said recombinant polypeptide, fragment, variant or derivative. 

In another aspect, the invention provides a method of producing a biologically 
10 active fragment of a polypeptide as broadly described above, comprismg: 

- detecting an activity associated with a fragment of the polypeptide set forth in 
SEQ ID NO: 2, wherein said activity is selected form the group consisting of acyl-CoA 
ligase activity, /3-ketoacyl synthase activity, jS-ketoacyl reductase, acyl carrier protein 
activity, adenylation activity, peptidyl carrier protein activity and condensation activity; 

15 or 

- detecting PPTase activity associated with a fragment of the polypeptide set 
forth in SEQ ID NO: 83; or 

- detectmg methyltransferase activity associated with a fragment of the 
polypeptide set forth m SEQ ID NO: 95; 

20 wherein detection of said activity is indicative of said fragment being a biologically 

active fragment. 

In a fiirther aspect, the invention provides a method of producing a biologically 
active fragment as broadly described above, comprising: 

- introducing a polynucleotide from which a fragment of a polypeptide as 
25 broadly described above can be produced into a cell; and 

- detecting an activity selected form the group consisting of acyl-CoA ligase 
activity, jS-ketoacyl synthase activity, /3-ketoacyl reductase, acyl carrier protein activity, 
adenylation activity, peptidyl carrier protein activity and condensation activity; or 

- detecting PPTase activity associated with a fragment of the polypeptide set 
30 forth in SEQ ID NO: 83; or 
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- detecting methyltransferase activity associated with a fragment of the 
polypeptide set forth in SEQ ID NO: 95; 

wherein detection of said activity is indicative of said fragment being a biologically 

active fragment. 

5 In yet a further aspect, the invention provides a method of producing a variant of a 

polypeptide as broadly described above (parent polypeptide), or a biologically active 
fragment thereof, comprising: 

- producing a modified polypeptide whose sequence is distinguished from the 
parent polypeptide or the biologically acHve fragment by substitution, deletion or 

1 0 addition of at least one amino acid; and 

_ detecting an.activity associated with the modified polypeptide, wherein said 

activity is selected form the group consisting of acyl-CoA Ugase activity, ^-ketoacyl 
synthase activity, ^-ketoacyl reductase, acyl carrier protein activity, adenylation 
activity, peptidyl carrier protein activity, condensation activity, PPTase activity and 
15 methyltransferase activity, wherem detection of said activity is indicative of said 
modified polypeptide being a variant. 

In a further aspect, the invention contemplates a method of producing a variant of 
a parent polypeptide as broadly described above, or biologically active fragment thereof, 
comprising: 

20 - producing a polynucleotide from which a modified polypeptide as described 

above can be produced; 

- introducing said polynucleotide into a cell; and 

- detecting an activity associated with the modified polypeptide, wherem said 
activity is selected form the group consisting of acyl-CoA ligase activity, 0-ketoacyl 

25 synthase activity, /3-ketoacyl reductase, acyl carrier protein activity, adenylation 
activity, peptidyl carrier protein activity, condensation activity, PPTase activity and 
methyltransferase activity, wherein detection of said activity is indicative of said 
modified polypeptide being a variant.. 

In yet another aspect, the invention extends to a method of screening for an agent 
30 that modulates the expression of a gene or variant thereof or the level and/or functional 
activity of an expression product of said gene or variant thereof, wherein said gene is 
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selected from xabB, xabA, or xabC, or a gene belonging to the same regulatory or 
biosynthetic pathway as xabB, xabA, orxabC, said method comprising: 

- contacting a preparation comprising a polypeptide encoded by said gene, or 
biologically active fragment of said polypeptide, or variant or derivative of these, or a 

5 genetic sequence (e.g., a transcriptional control element) that modulates the expression 
of said gene or variant thereof, with a test agent; and 

- detecting a change in the level and/or functional activity of said polypeptide or 
biologically active fragment thereof, or variant or derivative, or of a product expressed 
from said genetic sequence. 

10 The transcriptional control element preferably comprises the sequence set forth in 

SEQ ID NO : 81 or complement thereof 

The invention, in another aspect, also provides a method for enhancing the level 
and/or functional activity of an albicidin, said method comprising: 

- introducing into an albicidin-producing host cell (1) an agent that modulates 
15 the expression of a gene encoding at least a portion of an albicidin PKS-NRPS or 

variant or derivative thereof, or the level and/or functional activity of an expression 
product of said gene, or (2) a vector from which a polynucleotide encoding at least a 
portion of an albicidin PKS-NRPS or variant or derivative thereof can be translated; 

- and culturing the host cell for a time and imder conditions sufficient to 
20 enhance the level and/or functional activity of said albicidin. 

Preferably, the method further comprises introducing into said host cell a vector 
from which a PPTase can be translated. Suitably, the PPTase is selected from EntD or 
XabA. 

Preferably, the method further comprises introducing into said host cell a vector 
25 from which a methyltransferase, more preferably and .O-methyltransferase, and even more 
preferably an iS-adenosylmethionine 0-methyltransferase can be translated. 

Accordmg to another aspect of the invention, there is provided a method for 
enhancing the level and/or functional activity of an albicidm, said method comprising 
contacting a precursor of said albicidin or an intermediate involved in the biosynthesis of 
30 said albicidm with at least a portion of an albicidin PKS-NRPS, or variant or derivative 
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thereof, as broadly described above, for a time and under conditions sufficient to enhance 
the level and/or functional activity of said albicidin. 

Preferably, the method fiirther comprises contacting a precursor of said albicidin 
or an intermediate involved in the biosynthesis of said albicidin with a PPTase. 

5 Preferably, the method further comprises contacting a precursor of said albicidin 

or an intermediate involved in the biosynthesis of said albicidin with a methyltransferase, 
more preferably and 0-methyltransferase, and even more preferably an S- 
adenosylmethionine 0-methyltransferase. 

In another aspect, the invention provides a method of identifying a PPTase for 
10 enliancing the level and/or functional activity of an albicidin, said method comprising 
introducing into an albicidin-deficient strain ofX, albilineans which lacks xabA a vector 
comprising a polynucleotide encoding a test PPTase, wherem said polynucleotide is 
operably linked to a regulatory polynucleotide, and detecting production of albicidin. 

Suitably, the strain is LSI 56 described herein. 

1 5 Preferably, the PPTase is EntD. 

The invention, in another aspect, also provides a method for enhanciag the level 
and/or functional activity of an albicidin, said method comprising: 

- mtroducing into an albicidin-producing host cell (1) an agent that modulates 
the expression of a gene encoding at least a portion of a PPTase associated with 

20 albicidin biosynthesis or variant or derivative thereof, or the level and/or functional 

activity of an expression product of said gene, or (2) a vector from which a 
polynucleotide encoding at least a portion of a PPTase associated with albicidin 
biosynthesis or variant or derivative thereof can be translated; 

- and culturtng the host cell for a time and under conditions sufficient to 
25 enhance the level and/or functional activity of said albicidin 

In yet another aspect, the invention provides a method for enhancing the level 
and/or functional activity of an albicidin, said method comprising: 

- introducmg mto an albicidin-producing host cell (1) an agent that modulates 
the expression of a gene encoding at least a portion of a methyltransferase associated 
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with albicidin biosynthesis or variant or derivative thereof, or the level and/or 
functional activity of an expression product of said gene, or (2) a vector from which 
a polynucleotide encoding at least a portion of a methyltransferase associated with 
albicidin biosynthesis or variant or derivative thereof can be translated; 
5 - and culturing the host cell for a time and under conditions sufficient to 

enhance the level and/or fiinctional activity of said albicidin 

In another aspect, the invention resides in an antigen-binding molecule that is 
immuno-interactive with a polypeptide, fragment, variant or derivative as broadly 
described above. 

10 In yet another aspect, the invention provides a method to prepare a polynucleotide 

encoding a modified PKS. comprising using an albicidin PKS-NRPS encoding nucleotide 
sequence as a scaffold and modifying the portions of the nucleotide sequence that encode 
enzymatic activities, either by mutagenesis, inactivation, deletion, insertion, or 
replacement. 

15 In still yet anotlier aspect, the invention contemplates a method for producing 

polyketides, comprising expressing the modified albicidin PKS encoding nucleotide 
sequence as broadly described in a suitable host cell to thereby produce a polyketide 
different from that produced by the albicidin PKS-NRPS. 

Another aspect of the mvention contemplates the insertion of portions of the 
20 albicidin PKS-NRPS coding sequence into other PKS coding sequences to modify the 
products thereof. 

In a further aspect, the invention encompasses use of the polypeptide, fragment, 
variant or derivative as broadly described above, or the polynucleotide or vector as broadly 
described above, or the modulatory agent as broadly described above for producing 
25 secondary metabolites, preferably albicidins. 



wo 02/24736 



PCT/AUOl/01190 



-20- 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic representation showing a physical and functional map of 
part of the albicidin biosynthetic gene cluster. (A). Partial physical map of the Tn5 
insertion locus in LSI 57 genomic DNA. Restriction enzymes used: C, Clal; E, EcdRl] S, 
5 Spel] N, Notl'y and B, BamBI, (B). Probes used to recover clone pXABB: Probe 1, 1.4-kb 
EcoRl-Notl fragment digested from pBC157; and probe 2, 0.9-kb PGR product amplified 
from Xal3 genomic DNA using primers complementary to sequences flanking the Tn5 
insertion in LSI 57. (C). Clones and subclones used for sequencing, and described in Table 
1. (D), The transcription directions of three putative ORFs in 16.5-kb EcoRl fragment are 

10 indicated by arrows. (E). Organisation of Z albilineans XabB constructed by comparison 
with known protein sequences. The unshaded box indicates PKS region, and the shade box 
indicates NRPS region. Relative positions of potential catalytic domains or active sites are 
indicated by: AL, acyl-CoA hgase; ACP, acyl carrier protein; KS, /3-ketoacyl synthase; 
KR, jS-ketoacyl reductase; PGP, peptidyl carrier protein; C, condensation; A, adenylation. 

1 5 Horizontal bars indicate proposed biosynthetic modules. 

Figure 2 is a diagrammatic representation presenting the sequence of the region 
upstream from xabB, The nucleotide sequence is nimibered according to the 1651 1-bp 
sequence in GenBank accession no. AF239749. The putative -35 and -10 promoter 
sequences of xabB and the divergent gene xatA are underlined, as are ribosome-bindmg 
20 sequences. The transcriptional directions of xabB and xatA are indicated by arrows. 
Translational start codons are indicated by boldface type. Primers PlFl and PIR are 
shaded. 

Figure 3 is a diagrammatic representation showing the alignment ofX. albilineans 
XabB enzymatic domains with those of PKSs and FASs from other organisms. Identical 

25 amino acids are indicated by boldface type. Stars and overlines identify conserved amino 
acids at catalytic sites. Xal-XabB, X, albilineans XabB for biosynthesis of albicidin (this 
study); Hin-LCFA, Haemophilus influenza long-cham fatty acid-CoA ligase (P46450); 
Bsu-PksJ, 5. subtilis polyketide synthase J (P40806); Bsu-MycA, B. subtilis MycA for 
biosynthesis of mycosubtilin (AFl 84956); Pcr-ComL2, Petroselinum crispum 4- 

30 coumarate-CoA Ugase 2 (P14913); Sma-FkbB, S, sp. MA6548 FkbB for biosyntiiesis of 
FK506 (AF082099); Ame-RifA, Amycolatopsis mediterranei RifA for biosynthesis of 
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rifamycin B (AF040570); Shy-RapA, S, hygroscopicus RapA for biosynthesis of 
rapamycin (X86780); Mxa-Tal, M xanthus Tal for biosynthesis of TA (AJ006977); Ser- 
EryAl and EryA3, S, erythraea EryA modules for biosynthesis of erythromycin (M63676, 
M63677); Che-PKSl, Cochliobohis heterostrophus PKSl for biosynthesis of T-toxin 
5 (U68040); Bsu-PksM, 5. subtilis PKS for a polyketide synthase (031781); Mtu-PpsA, M. 
tiiberadosis PKS for a polyketide synthase (G3261605); Mtu-MAS, M tuberculosis MAS 
for biosynthesis of mycocerosic acid (M95808); Chick-FAS, chichen fatty acid synthase 
(M22987); Rat-FAS, rat fatty acid synthase (X14175). 

Figure 4 is a graphical representation showing albicidin production by wild-type 
10 X. albilineans LS155 (A), complemented Tox mutant strain LS157 pLXABBl (O), 
complemented Tox mutant strain LS157 pLXABB2 (•), LS157 (■), and LS157 
pLAFR3 (+). Albicidin concentrations in culture supematants were quantified based on 
inhibition zone width in a microbial bioassay (means +/- standard errors from 5 repUcates). 

Figure 5 is a graphical representation showing the relationship between growth 
15 (■), albicidin production (O), and GUS activity (A) in X, albilineans LS155 pRG960pl 
(A) and in LSI 55 pRG960p2 (B). Relative activity (means +/- standard errors from 2 
replicates): 100% growth, OD550 = 1.43; 100% albicidin production = 268.5 units/ml; 
100% GUS activity = 119 units/mg of protein (one unit equals 1 pmol of 
methylumbelliferone fomied per mm.). Locations and sizes of inserts on pRG960pl and 
20 pRG960p2 are indicated in Figure 2 and Table 1 . GUS, ^-glucuronidase. 

Figure 6 is a schematic representation showing the organisation of five known 
PKS-NRPS enzymes. X. albilineans XabB, encoded by xabB for albicidin biosynthesis 
(this study); 5. subtilis MycA for mycosubtiUn biosynthesis (Duitman et al, 1999); 
Yersinia pestis HMWPl for yersiniabactin biosynthesis (Gehring et aLy 1998); M xanthus 
25 partial gene product Tal for TA biosynthesis (Paitan et al, 1999); 5. subtilis PksorfX6 for 
unknown function (Albertini et al, 1995). Unshaded boxes indicate PKS regions, grey 
boxes indicate NRPS regions, and dark boxes indicate amino transferase (AMT) or 
methyltransferase (MT). Vertical bars follow the carrier domains at the end of each 
biosynthetic "module". 

30 Figure 7 is a diagrammatic representation showmg a dendrogram (GCG) analysis 

of adenylation domains of XabB and its homologous peptide synthetases. Peptide 
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synthetases, including various modules of the same multienzyme complex, are as follows: 
GrsA and GrsB, gramicidin synthetase A and B, respectively, from B. subtilis (XI 5577, 
X61658); BacA, BacB, and BacC, bacitracin synthetase A, B, and C, respectively, from B, 
licheniformis (AF007865); SnbC and SnbDE, pristinamycin I synthetase C and DE, 
5 respectively, from 5. pristinaespiralis (X98690, Y11547); FkbP, FK506 synthetase FkbP 
from S, sp. MA6548 (AF082100); TycA, TycB, and TycC, tyrocidine synthetase A, B, and 
C, respectively, from 5. brevis (AF004835); SyrE, syringomycin synthetase El from 
Pseudomonas syringae pv. syringae (AF047828); EntF, enterobactin synthetase F from E. 
coli (PI 1454); DhbF, 2,3-dihydroxybenzoate synthetase F from B, subtilis (P45745); 
10 FenD, fengycin synthetase FenDl from 5. subtilis (AJOl 1849); SrfAA, SrfAB, and SrfAC, 
surfactin A synthetase A, B, and C, respectively, from 5. subtilus (X70356); XabB, 
albicidin synthase B from X albilineans (this study). The A4 to A5 regions (about 100 aa) 
of adenylation domains of peptide synthetases, which is involved in amino acid recognition 
and binding, were aligned using the PILEUP program with default parameters. 

15 Figure 8 is a diagrammatic representation showing a restriction map of clones 

including the xabA gene from X, albilineans. Sequencing by primer walking commenced at 
the T3 and T7 primers. The location and direction of transcription of the xabA ORF is 
shown by an arrow. Restriction enzymes are: E, Ec6Sl\ P, Pstl\ C, C/al; and H, HinSSl 

Figure 9 is a diagrammatic representation presenting the sequence of the xabA 
20 gene. The nucleotide sequence is numbered according to the 3-kb sequence in GenBank 
accession no. AF191324. The closest matches to RBS region and promoter consensus 
sequences are underlined, as are the region of dyad symmetry and putative factor- 
independent termination sites. Translation start and stop codons are indicated by boldface 
type. The (V/I)G(V/I)D and (FAV)(S/C/T)xKE(A/S)xxK motifs conserved in PPTase 
25 enzymes are boxed. The insertion site of Tn5 is marked (T). 

Figure 10 is a graphical representation showing albicidin production by wild-type 
X, albilineans strahi Xal3 (O), Xal3 pLXABA (•), and complemented Tox" mutant strain 
LSI 56 pLXABA (A). Albicidm concentrations in culture supematants were quantified 
based on inhibition zone width in a microbial bioassay (means +/- standard errors from 2 
30 replicates). 



wo 02/24736 



PCT/AlJOl/01190 



-23- 

Figure 11 is a schematic representation showing a dendrogram (GCG) analysis of 
PPTases involved in antibiotic and fatty acid biosynthesis in bacteria. Sau, Salmonella 
austin\ Sty, Salmonella typhymurium; Bbr, Bacillus brevis; Xal, Xanthomonas albilineam; 
Eco, Escherichia coli; Sfl, Shigella flexneri; Bpu, Bacillus pumilus; Bsu, Bacillus subtilis; 
5 Mtu, Mycobacterium tuberculosis; Hin, Haemophilus influenzae. The sources of amino 
acid sequence of PPTases correspond to those in Table 2, and the sequences v^ere aligned 
using the PELEUP program v^ith default parameters. 

Figure 12 is a schematic representation showing the organisation of part of the 
albicidin biosynthetic gene cluster. The location and direction of three ORFs are indicated 

10 by tliick anows. Vertical lines indicate the position of restriction enzyme sites: E, Eco"Sl\ 
B, 5amHI; S, iS^el; N, Ncol. The vertical lines vidth triangles (-^) show the position of 
insertional mutagenesis sites or Tn5 insertion site, and the resultant mutants are bracketed. 
The arrows above the physical map indicate the locations of primers used to amplify 
sequence downstream of the £coRI restriction site by IPCR. The cloned regions for 

1 5 complementation tests are shown below the map . 

Figure 13 is a diagrammatic representation presenting the nucleotide and deduced 
amino acid sequences of the xabC region. The nucleotide sequence is numbered according 
to the 1515-bp sequence in GenBank accession no. AF239750. The potential RBS and 
selected restriction sites are underlined. The putative factor-independent termination 
20 signals are underlined and indicated by bold letters. Translation start and stop codons are 
indicated by bold letters. The conserved motifs in Mtases are boxed. Primers used for PGR 
(A3F and A3R) and IPCR (IR) are shaded. 

Figure 14 is a diagranmiatic representation showing the conserved sequence 
motifs in Mtases involved in antibiotic biosynthesis in bacteria. Identical or similar amino 

25 acids (A = G; D = E; I = L = V) are shown in bold. Numbers indicate amino acid residues 
from the N terminus of the protein. Xal-XabC, putative albicidin biosynthesis Mtase from 
X, albilineans (this study); Sgl-TcmO and Sgl-TcmN, multifunctional cyclase-dehydrase- 
3-O-Mtase and tetracenomycin polyketide syntiaesis 8-0-Mtase of Streptomyces 
glaucescens, respectively (accession number M80674); Smy-MdmC, midecamycin-O- 

30 Mtase of S, mycarofaciens (M93958); Mxa-SafC, saframycin 0-Mtase of Myxococcus 
xanthus (U24657); Ser-EryG, erythromycin biosynthesis 0-Mtase of Saccharopolyspora 
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erythraea (S18533); Spe-DauK, carminomycin 4-0-Mtase from S. peucetius (L13453); 
Sal-DmpM. 0-demethylpuromycin-O-Mtase from S. alboniger (M74560); Shy-RapM, 
rapamycin 0-Mtase of ^. hygroscopicus (X86780); Sav-AveD. avennectin B 5-6?-Mtase 
from S. avemiUilis (G5921167). 

Figure 15 is a graphical representation showing albicidin production by wild-type 
X. albilineans LS155 (•), Tox xabC insertion mutant LS-JP2 (■). complemented strain 
LS-JP2 pLXABC containing Lac promoter- foil length xabC gene (O), and complemented 
strain LS-JP2 pLXABBl containing foil length xabB plus fonctional N-terminal region of 
xabC (TI). Albicidin concentrations in culture supematants were quantified based on 
inhibition zone width in a microbial bioassay (means +/- standard errors from 2 or 3 
replicates). 
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BRIEF DESCRIPTION OF THE SEQUENCES: SUMMARY TABLE 



TABLE A 



SEQUENCE ID 
NUMBER 


SEQUENCE 




SEQIDNO:! 


Full-length (Accession No. AF239749) 


loDDl bases 


SEQIDN0:2 


Full-length polypeptide sequence encoded by SEQ 
ID NO: 1 


4801 residues 


SEQIDNO: 3 


Full-length coding sequence of xabB 


14406 bases 


SEQ ID NO: 4 


Polypeptide sequence encoded by SEQ ID NO: 3 


4801 residues 


SEQ ID NO: 5 


Sub-sequence oi ocX^ UJ jnu. i ana d encouing acyi- 
CoA ligase subdomain I 




SEQ YD NO: 6 


Acyl-CoA ligase subdomain I encoaea by ohQ IJJ 
NO: 5 


1 ^ /111 

ij resiaucs 


SEQ ID NO: 7 


Sub-sequence oi bbQ UD JNU. l ana 5 encoaing acyi- 
CoA ligase subdomain n 


Z*+ DcloCa 


SEQ ID NO: 8 


Acyl-CoA ligase subaomain 1 encoaea by oHQ UJ 
NO: 7 


o resiuucs 


SEQ ID NO: 9 


bub-sequence oi hhKl JJJ iNU. i ana j encoaing p- 
ketoacyl synthase 1 subdomain I 




SEQIDNO: 10 


|3-Ketoacyl synthase 1 subdomain I encoded by SEQ 
ID NO: 9 


1 / resiuuca 


bJaQ i-U JNU: 11 


oUD-sequence oi onv^ xu i aiiu o cuvuunig fj 
ketoacyl synthase 1 subdomain U 


'^0 bases 


SEQIDNO: 12 


j3-Ketoacyl synthase 1 subdomain U encoded by SEQ 
ID NO: 11 


10 residues 


SEQIDNO: 13 


Sub-sequence of SEQ ID NO: 1 and 3 encoding )3- 
ketoacyl synthase 1 subdomain EQ 


30 bases 


SEQIDNO: 14 


j3-Ketoacyl synthase 1 subdomain III encoded by 
SEQ ID NO: 13 


10 residues 


SEQIDNO; 15 


Sub-sequence of SEQ ID NO: 1 and 3 encoding jS- 
ketoacyl synthase 2 subdomain I 


51 bases 
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SEQUENCE ID 
NUMBER 


. SEQUENCE ^ 


LENGTH 


SEQIDNO: 16 


jS-Ketoacyl synthase 2 subdomain I encoded by SEQ 
ID NO: 15 


17 residues 


SEQIDNO: 17 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 0- 
ketoacyl synthase 2 subdomain n 


30 bases 


SEQIDNO: 18 


j3-Ketoacyl synthase 2 subdomain n encoded by SEQ 
ID NO: 17 


10 residues 


SEQIDNO: 19 


Sub-sequence of SEQ ID NO: 1 and 3 encoding /3- 
ketoacyl synthase 2 subdomain III 


30 bases 


SEQIDNO: 20 


j8-Ketoacyl synthase 2 subdomain HI encoded by 
SEQIDNO: 19 


10 residues 


SEQIDNO: 21 


Sub-sequence of SEQ ID NO: 1 and 3 encoding ^- 
ketoacyl reductase domain 


93 bases 


SEQIDNO: 22 


i3-Ketoacyl reductase domain encoded by SEQ ID 
NO: 21 


31 residues 


SEQIDNO: 23 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl 
carrier protein 1 domain 


36 bases 


SEQIDNO: 24 


Acyl carrier protein 1 domain encoded by SEQ ID 
NO: 23 


12 residues 


SEQIDNO: 25 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl 
carrier protein 2 domain 


36 bases 


SEQIDNO: 26 


Acyl carrier protein 2 domain encoded by SEQ ID 
NO: 25 


12 residues 


SEQIDNO: 27 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl 
carrier protein 3 domain 


36 bases 


SEQ ID NO: 28 


Acyl carrier protein 3 domain encoded by SEQ ID 
NO: 27 


12 residues 


SEQIDNO: 29 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain I 


18 bases 


SEQ ID NO: 30 


Adenylation domain subdomain I encoded by SEQ 
ID NO: 29 


6 residues 


SEQ ID NO: 31 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain n 


33 bases 
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SEQUENCE ID 
NUMBER. 


SEQUENCE . 


LENGTH 


SEQIDNO: 32 


Adenylation domain subdomain n encoded by SEQ 
ID NO: 31 


1 1 residues 


SEQ ID NO: 33 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain HI 


48 bases 


SEQ ID NO: 34 


Adenylation domain subdomain EI encoded by SEQ 
ID NO: 33 


16 residues 


SEQIDNO: 35 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain IV 


12 bases 


SEQ IDNO: 36 


Adenylation domain subdomain IV encoded by SEQ 
ID NO: 35 


4 residues 


SEQIDNO: 37 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain V 


21 bases 


SEQ ID NO: 38 


Adenylation domain subdomain V encoded by SEQ 
ID NO: 37 


7 residues 


SEQ ID NO: 39 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain VI 


45 bases 


SEQ ID NO: 40 


Adenylation domain subdomain VI encoded by SEQ 
ID NO: 39 


15 residues 


SEQ ID NO: 41 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain Vn 


18 bases 


SEQ ID NO: 42 


Adenylation domain subdomain Vn encoded by SEQ 
ID NO: 41 


6 residues 


SEQ ID NO: 43 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain VIII 


60 bases 


SEQ ID NO: 44 


Adenylation domain subdomain Vm encoded by 
SEQIDNO: 43 


20 residues 


SEQIDNO: 45 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain IX 


21 bases 


SEQ m NO: 46 


Adenylation domain subdomain IX encoded by SEQ 
ID NO: 45 


7 residues 


SEQ ID NO: 47 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain X 


18 bases 
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SEQUENCE ID 
NUMBER 


SEQUENCE . 


LENGTH 


SEQE)NO: 48 


Adenylation domain subdomain X encoded by SEQ 
ID NO: 47 


6 residues 


SEQIDNO: 49 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
peptidyl carrier protein 1 domain 


33 bases 


SEQIDNO: 50 


Peptidyl carrier protein 1 domain encoded by SEQ 
ED NO: 49 


1 1 residues 


SEQIDNO: 51 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
peptidyl carrier protein 2 domain 


33 bases 


SEQIDNO: 52 


Peptidyl carrier protein 2 domain encoded by SEQ 
ID NO: 51 


11 residues 


SEQIDNO: 53 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain I 


30 bases 


SEQIDNO: 54 


Condensation domain 1 subdomain I encoded by 
SEQIDNO: 53 


10 residues 


SEQIDNO: 55 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain n 


27 bases 


SEQIDNO: 56 


Condensation domain 1 subdomain n encoded by 
SEQ ID NO: 55 


9 residues 


SEQIDNO: 57 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain HI 


30 bases 


SEQIDNO: 58 


Condensation domain 1 subdomain m encoded by 
SEQ ID NO: 57 


10 residues 


SEQIDNO: 59 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain IV 


21 bases 


SEQIDNO: 60 


Condensation domain 1 subdomain IV encoded by 
SEQIDNO; 59 


7 residues 


SEQIDNO: 61 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain V 


36 bases 


SEQIDNO: 62 


Condensation domain 1 subdomain V encoded by 
SEQ ID NO: 61 


12 residues 


SEQIDNO: 63 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain VI 


21 bases 
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SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 


SEQroNO: 64 


Condensation domain 1 subdomain VI encoded by 
SEQroNO: 63 


7 residues 


SEQIDNO: 65 


Sub-sequence of SEQ ro NO: 1 and 3 encoding 
condensation domain 1 subdomain Vn 


24 bases 


SEQIDNO: 66 


Condensation domain 1 subdomain VU encoded by 
SEQroNO: 65 


8 residues 


SEQ ED NO: 67 


Sub-sequence of SEQ ro NO: 1 and 3 encoding 
condensation domain 2 subdomain I 


30 bases 


SEQ ID NO: 68 


Condensation domain 2 subdomain I encoded by 
SEQroNO: 67 


10 residues 


SEQroNO: 69 


Sub-sequence of SEQ ro NO: 1 and 3 encoding 
condensation domain 2 subdomain II 


27 bases 


SEQ ro NO: 70 


Condensation domain 2 subdomain n encoded by 
SEQroNO: 69 


9 residues 


SEQroNO: 71 


Sub-sequence of SEQ ro NO: 1 and 3 encoding 
condensation domain 2 subdomain En 


30 bases 


SEQ ro NO: 72 


Condensation domain 2 subdomain in encoded by 
SEQroNO: 71 


10 residues 


SEQroNO: 73 


Sub-sequence of SEQ ro NO: I and 3 encoding 
condensation domain 2 subdomain IV 


21 bases 


SEQroNO: 74 


Condensation domain 2 subdomain IV encoded by 
SEQroNO: 73 


7 residues 


SEQroNO: 75 


Sub-sequence of SEQ ro NO: 1 and 3 encoding 
condensation domain 2 subdomain V 


33 bases 


SEQroNO: 76 


Condensation domain 2 subdomain V encoded by 
SEQroNO: 75 


1 1 residues 


SEQ ro NO: 77 


Sub-sequence of SEQ ro NO: 1 and 3 encoding 
condensation domain 2 subdomain VI 


21 bases 


SEQroNO: 78 


Condensation domain 2 subdomain VI encoded by 
SEQroNO: 77 


7 residues 


SEQroNO: 79 


Sub-sequence of SEQ ro NO: 1 and 3 encoding 
condensation domain 2 subdomain VII 


24 bases 
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SEQUENCE ID 
^fUMBER 


SEQUENCE. ' 


LENGTH 


SEOIDNO: 80 


Condensation domain 2 subdomain VII encoded by 
SEQIDNO: 79 


8 residues 


SEQBDNO: 81 


Polynucleotide comprising xahB promoter 


242 bases 


SEQIDNO: 82 


Full-length xaM (Accession No. AF191324) 


1200 bases 


SEQ ID NO: 83 


Full-length polypeptide sequence encoded by SEQ 
ID NO: 82 


278 residues 


SEQIDNO: 84 


Full-length coding sequence of xabA 


837 bases 


SEQIDNO: 85 


Polypeptide sequence encoded by SEQ ID NO: 84 


278 residues 


SEQIDNO: 86 


Sub-sequence of SEQ ID NO: 82 encoding PPTase 
domain 


168 bases 


SEQIDNO: 87 


PPTase domain encoded by SEQ ID NO: 86 


56 residues 


SEQIDNO: 88 


Sub-sequence of SEQ ID NO: 82 encoding a motii 
(motif ]5 conserved in PPTases 


2/ bases 


SEQIDNO: 89 


PPTase motif I amino acid sequence encoded by 
SEQIDNO: 88 


9 residues 


SEQIDNO: 90 


Sub-sequence of SEQ ID NO: 82 encoding 
intervening amino acid sequence linking motifs I and 

n 


117 bases 


SEQIDNO: 91 


Intervening amino acid sequence encoded by SEQ ID 
NO: 90 


39 residues 


SEQIDNO: 92 


Sub-sequence of SEQ ID NO: 82 encoding a motif 
(motif n) conserved in PPTases 


36 bases 


SEQIDNO: 93 


PPTase motif n amino acid sequence encoded by 
SEQIDNO: 92 


12 residues 


SEQIDNO: 94 


Full-length xabC (Accession No. AF239750) 


1515 bases 


SEQIDNO: 95 


Full-length polypeptide sequence encoded by SEQ 
ID NO: 94 


343 residues 


SEQIDNO: 96 


Full-length coding sequence of xabC 


1029 bases 


SEQIDNO: 97 


Polypeptide sequence encoded by SEQ ID NO: 96 


343 residues 
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SEQUENCE ID 
NUMBER 


SEQUENCE . 


LENGTH 


SEQIDNO:98 


Sub-sequence of SEQ ID NO: 94 encoding a motif 
(motif JO conserved in methyltransferases 


21 bases 


SEQIDNO:99 


Methyltransferase motif I amino acid sequence 
encoded by SEQ ID NO: 98 


7 residues 


SEQIDNO: 100 


Sub-sequence of SEQ ID NO: 94 encoding a motif 
(motif n) conserved in methyltransferases 


24 bases 


SEQIDNO: 101 


Methyltransferase motif H amino acid sequence 
encoded by SEQ ID NO : 1 00 


8 residues 


SEOIDNO- 102 


Sub-sequence of SEQ ED NO: 94 encoding a motif 
(motif ni) conserved in methyltransferases 


27 bases 


SEQIDNO: 103 


Methyltransferase motif EI amino acid sequence 
encoded by SEQ ID NO: 102 


9 residues 


SEQIDNO: 104 


Polynucleotide encoding said motifs I, n and m 


303 bases 


SEQIDNO: 105 


Polypeptide encoded by SEQ ID NO: 104 


101 residues 


SEQIDNO: 106 


Biologically active fragment of SEQ ID NO: 94 


831 bases 


SEQIDNO: 107 


Biologically active fragment of SEQ ID NO: 95 


277 residues 
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DETABLED DESCRIPTION OF THE INVENTION 
L Definitions 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood by those of ordinary skill in the art to which the 
5 invention belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, preferred 
methods and materials are described. For the purposes of the present invention, the 
following terms are defined below. 

The articles "a" and are used herein to refer to one or to more than one (i.e. 
10 to at least one) of the grammatical object of the article. By way of example, "an element" 
means one element or more than one element. 

The term ''about" is used herein to refer to sequences that vary by as much as 
30%, preferably by as much as 20% and more preferably by as much as 10% to the length 
of a reference sequence. 

15 By "agent*' is meant a naturally occurring or synthetically produced molecule 

which interacts either directly or indirectly with a target member, the level and/or 
functional activity of which are to be modulated. 

''Amplification product" refers to a nucleic acid product generated by nucleic acid 
amplification techniques. 

20 By "antigen-binding molecule " is meant a molecule that has binding affinity for a 

target antigen. It will be understood that this term extends to immunoglobulins, 
immunoglobulin fragments and non-immunoglobulin derived protein frameworks that 
exhibit antigen-binding activity. 

As used herein, the term "binds specifically" and the like refers to antigen- 
25 binding molecules that bind the polypeptide or polypeptide fragments of the invention but 
do not significantly bind to homologous prior art polypeptides. 
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By ''biologically active Jragment" is meant a fragment of a full-length parent 
polypeptide which fragment retains the activity of the parent polypeptide. A biologically 
active fragment will therefore comprise an activity selected form the group consistmg of 
acyl-CoA Ugase activity, /3-ketoacyl synthase activity, jS-ketoacyl reductase, acyl carrier 

5 protein activity, adenylation activity, peptidyl carrier protein activity, condensation 
activity, PPTase activity and methyltransferase activity. As used herein, the term 
''biologically active fragment" includes deletion mutants and small peptides, for example 
of at least 10, preferably at least 20 and more preferably at least 30 contiguous amino 
acids, which comprise the above activities. Peptides of this type may be obtained through 

10 the application of standard recombinant nucleic acid techniques or synthesised using 
conventional liquid or solid phase synthesis techniques. For example, reference may be 
made to solution synthesis or solid phase synthesis as described, for example, in Chapter 9 
entitled "Peptide Synthesis'* by Atherton and Shephard which is included in a publication 
entitled ''Synthetic Vaccines" edited by Nicholson and published by Blackwell Scientific 

15 Publications. Alternatively, peptides can be produced by digestion of a polypeptide of the 
invention with proteinases such as endoLys-C, endoArg-C, endoGlu-C and staphylococcus 
V8-protease. The digested fragments can be purified by, for example, high performance 
liquid chromatographic (HPLC) techniques. 

Throughout this specification, unless the context requires otherwise, the words 
20 "comprise "comprises " and "comprising" will be understood to imply the inclusion of a 
stated step or element or group of steps or elements but not the exclusion of any other step 
or element or group of steps or elements. 

By "'corresponds td' or '"corresponding to'' is meant a polynucleotide (a) having a 
nucleotide sequence that is substantially identical or complementary to all or a portion of a 
25 reference polynucleotide sequence or (b) encoding an amino acid sequence identical to an 
amino acid sequence in a peptide or protein. This phrase also includes within its scope a 
peptide or polypeptide having an amino acid sequence that is substantially identical to a 
sequence of amino acids in a reference peptide or protein. 

By "derivative" is meant a polypeptide that has been derived from the basic 
30 sequence by modification, for example by conjugation or complexing with other chemical 
moieties or by post-translational modification techniques as would be understood in the art. 
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The term "derivative" also includes within its scope alterations that have been made to a 
parent sequence including additions, or deletions that provide for fiinctionally equivalent 
molecules. Accordingly, the term derivative encompasses molecules that will have an 
activity selected form the group consisting of acyl-CoA ligase activity, /3-ketoacyl synthase 
5 activity, /3-ketoacyl reductase, acyl carrier protein activity, adenylation activity, peptidyl 
carrier protein activity, condensation activity, PPTase activity and methyltransferase 
activity. 

"Homology" refers to the percentage number of amino acids that are identical or 
constitute conservative substitutions as defined in Table B infra. Homology may be 
10 determmed using sequence comparison programs such as GAP (Deveraux et al. 1984, 
Nucleic Acids Research 12, 387-395). In this way, sequences of a similar or substantially 
different length to those cited herein might be compared by insertion of gaps into the 
alignment, such gaps being determined, for example, by the comparison algorithm used by 
GAP. 

1 5 "Hybridisation" is used herein to denote the pairing of complementary nucleotide 

sequences to produce a DNA-DNA hybrid or a DNA-KNA hybrid. Complementary base 
sequences are those sequences that are related by the base-pairing rules. In DNA, A pairs 
with T and C pairs with G. In RNA U pairs with A and C pairs with G. In this regard, the 
terms "match" and "mismatch" as used herein refer to the hybridisation potential of paired 

20 nucleotides in complementary nucleic acid strands. Matched nucleotides hybridise 
efficiently, such as the classical A-T and G-C base pair mentioned above. Mismatches are 
other combinations of nucleotides that do not hybridise efficiently. 

Reference herein to "immuno-interactive" includes reference to any interaction, 
reaction, or other form of association between molecules and in particular where one of the 
25 molecules is, or mimics, a component of the immune system. 

By "immuno-interactive fragment" is meant a firagment of a parent or reference 
polypeptide as described herein, which firagment eUcits an immune response, including the 
production of elements that specifically bind to said polypeptide, or variant or derivative 
thereof As used herein, the term "immuno-interactive fragment " includes deletion mutants 
30 and small peptides, for example of at least six, preferably at least 8 and more preferably at 
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least 20 contiguous amino acids, which comprise antigenic determinants or epitopes. 
Several such fragments may be jomed together. 

By ''isolated" is meant material that is substantially or essentially free from 
components that normally accompany it in its native state. For example, an "isolated 
5 polynucleotide", as used herein, refers to a polynucleotide, which has been purified from 
the sequences which flank it m a naturally occurring state, e,g,, a DNA fragment which has 
been removed from the sequences which are normally adjacent to the fragment. 

By "modulating** is meant increasing or decreasing, either directly or indirectly, 
the level and/or functional activity of a target molecule. For example, an agent may 
10 indirectly modulate the said level/activity by interacting with a molecule other than the 
target molecule. In this regard, indirect modulation of a gene encoding a target polypeptide 
includes within its scope modulation of the expression of a first nucleic acid molecule, 
wherein an expression product of the first nucleic acid molecule modulates the expression 
of a nucleic acid molecule encoding the target polypeptide. 

15 By obtained from" is meant that a sample such as, for example, a nucleic acid 

extract or polypeptide extract is isolated from, or derived from, a particular source. For 
example, the extract may be isolated directly from any organism that produces secondary 
metabolites, preferably from an albicidm-producing microorganism, more preferably from 
microorganisms of the gQWS Xanthomonas. 

20 The term ^'oligonucleotide'' as used herein refers to a polymer composed of a 

multiplicity of nucleotide units (deoxyribonucleotides or ribonucleotides, or related 
structural variants or synthetic analogues thereof) linked via phosphodiester bonds (or 
related structural variants or synthetic analogues thereof). Thus, while the term 
"oligonucleotide" typically refers to a nucleotide polymer in which the nucleotides and 

25 linkages between them are naturally occurring, it will be understood that the term also 
includes within its scope various analogues including, but not restricted to, peptide nucleic 
acids (PNAs), phosphoramidates, phosphorothioates, methyl phosphonates, 2-0-methyl 
ribonucleic acids, and the Uke. The exact size of the molecule may vary depending on the 
particular application. An oligonucleotide is typically rather short in length, generally from 

30 about 10 to 30 nucleotides, but the term can refer to molecules of any length, although the 
term "polynucleotide" or "nucleic acid" is typically used for large oligonucleotides. 
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By ''operabfy linked" is meant that transcriptional and translational regulatory 
nucleic acids are positioned relative to a polypeptide-encoding polynucleotide in such a 
manner that the polynucleotide is transcribed and the polypeptide is translated. 

The term polynucleotide" or ''nucleic acid"" as used herein designates mRNA, 
5 RNA, cRNA, cDNA or DNA. The term typically refers to oUgonucleotides greater than 30 
nucleotides in length. 

The terms polynucleotide variant" and ^'variant" refer to polynucleotides 
displaying substantial sequence identity with a reference polynucleotide sequence or 
polynucleotides that hybridise with a reference sequence under stringent conditions that are 

10 defined heremafter. These terms also encompass polynucleotides in which one or more 
nucleotides have been added or deleted, or replaced with different nucleotides, hi this 
regard, it is well understood in the art that certain alterations inclusive of mutations, 
additions, deletions and substitutions can be made to a reference polynucleotide whereby 
the altered polynucleotide retains the biological function or activity of the reference 

15 polynucleotide. The terms "polynucleotide variant" and ''variant" also include naturally 
occurring allelic variants. 

''Polypeptide'', peptide'' and protein" are used mterchangeably herein to refer to 
a polymer of amino acid residues and to variants and synthetic analogues of the same. 
Thus, these terms apply to amino acid polymers in which one or more amino acid residues 
20 is a synthetic non-naturally occurring amino acid, such as a chemical analogue of a 
con-esponding naturally occurring amino acid, as well as to naturally-occurring amino acid 
polymers. 

The term polypeptide variant" refers to polypeptides in which one or more 
amino acids have been replaced by different amino acids. It is well xmderstood in the art 

25 that some amino acids may be changed to others with broadly similar properties without 
changing the nature of the activity of the polypeptide (conservative substitutions) as 
described hereinafter. These terms also encompass polypeptides in which one or more 
amino acids have been added or deleted, or replaced with different amino acids. 
Accordingly, polypeptide variants as used herein encompass polypeptides that have an 

30 activity selected form the group consisting of acyl-CoA Ugase activity, /3-ketoacyl synthase 
activity, i3-ketoacyl reductase, acyl carrier protein activity, adenylation activity, peptidyl 
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carrier protein activity, condensation activity, PPTase activity and methyltransferase 
activity. 

By ''primer'' is meant an oligonucleotide which, when paired with a strand of 
DNA, is capable of initiating the synthesis of a primer extension product in the presence of 
5 a suitable polymerising agent. The primer is preferably single-stranded for maximum 
efficiency in amplification but may alternatively be double-stranded. A primer must be 
sufficiently long to prime the synthesis of extension products in the presence of the 
polymerisation agent. The length of the primer depends on many factors, includmg 
application, temperature to be employed, template reaction conditions, other reagents, and 

10 source of primers. For example, depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15 to 35 or more nucleotides, althougji it may 
contain fewer nucleotides. Primers can be large polynucleotides, such as firom about 200 
nucleotides to several kilobases or more. Primers may be selected to be "substantially 
complementary" to the sequence on the template to which it is designed to hybridise and 

1 5 serve as a site for the initiation of synthesis. By "substantially complementary", it is meant 
that the primer is sufficiently complementary to hybridise with a target nucleotide 
sequence. Preferably, the primer contains no mismatches with the template to which it is 
designed to hybridise but this is not essential. For example, non-complementary 
nucleotides may be attached to the 5' end of the primer, with the remainder of the primer 

20 sequence being complementary to the template. Alternatively, non-complementary 
nucleotides or a stretch of non-complementary nucleotides can be interspersed into a 
primer, provided that the primer sequence has sufficient complementarity with the 
sequence of the template to hybridise therewith and thereby form a template for synthesis 
of the extension product of the primer. 

25 "Probe " refers to a molecule that binds to a specific sequence or sub-sequence or 

other moiety of another molecule. Unless otherwise indicated, the term "probe" typically 
refers to a polynucleotide probe that binds to another nucleic acid, often called the "target 
nucleic acid", through complementary base pairing. Probes may bind target nucleic acids 
lacking complete sequence complementarity with the probe, depending on the stringency 

30 of the hybridisation conditions. Probes can be labelled directly or indirectly. 
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The term "recombinant polynucleotide" as used herein refers to a polynucleotide 
formed in vitro by the manipulation of nucleic acid into a form not normally found in 
nature. For example, the recombinant polynucleotide may be in the form of an expression 
vector. Generally, such expression vectors include transcriptional and translational 
5 regulatory nucleic acid operably linked to the nucleotide sequence. - 

By "recombinant polypeptide" is meant a polypeptide made usmg recombinant 
techniques, i.e., through the expression of a recombinant polynucleotide. 

By "reporter molecule" as used in the present specification is meant a molecule 
that, by its chemical nature, provides an analytically identifiable signal that allows the 
10 detection of a complex comprising an antigen-binding molecule and its target antigen. The 
term "reporter molecule" also extends to use of cell agglutination or inhibition of 
agglutination such as red blood cells on latex beads, and the like. 

Terms used to describe sequence relationships between two or more 
polynucleotides or polypeptides include "reference sequence", "comparison window", 

15 "sequence identity", "percentage of sequence identity" and "substantial identity". A 
"reference sequence" is at least 12 but frequently 15 to 18 and often at least 25 monomer 
units, inclusive of nucleotides and amino acid residues, in length. Because two 
polynucleotides may each comprise (1) a sequence {i.e., only a portion of the complete 
polynucleotide sequence) that is similar between the two polynucleotides, and (2) a 

20 sequence that is divergent between the two polynucleotides, sequence comparisons 
between two (or more) polynucleotides are typically performed by comparing sequences of 
the two polynucleotides over a "comparison window" to identify and compare local 
regions of sequence similarity. A "comparison window" refers to a conceptual segment of 
at least 6 contiguous positions, usually about 50 to about 100, more usually about 100 to 

25 about 150 in which a sequence is compared to a reference sequence of tiie same number of 
contiguous positions after the two sequences are optimally aUgned. The comparison 
window may comprise additions or deletions (i.e., gaps) of about 20% or less as compared 
to the reference sequence (which does not comprise additions or deletions) for optimal 
aUgnment of the two sequences. Optimal alignment of sequences for aUgning a comparison 

30 window may be conducted by computerised implementations of algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsm Genetics Software Package Release 
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7.0, Genetics Computer Group, 575 Science Drive Madison, WI, USA) or by inspection 
and the best alignment resulting in the highest percentage homology over the 
comparison window) generated by any of the various methods selected. Reference also 
may be made to the BLAST family of programs as for example disclosed by Altschul et 
5 al, 1997, Niicl Acids Res. 25:3389. A detailed discussion of sequence analysis can be 
found in Unit 19.3 of Ausubel et al, "Current Protocols in Molecular Biolog/', John 
Wiley & Sons Inc, 1994-1998, Chapter 15. 

The term ''sequence identity*' as used herein refers to the extent fliat sequences 
are identical on a nucleotide-by-nucleotide basis or an amino acid-by-amino acid basis 

10 over a window of comparison. Thus, a ''percentage of sequence identity" is calculated by 
comparing two optimally aUgned sequences over the window of comparison, determining 
the number of positions at which the identical nucleic acid base (e.g.. A, T, C, G, I) or the 
identical amino acid residue (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, He, Phe, Tyr, Trp, Lys, 
Arg, His, Asp, Glu, Asn, Gbi, Cys and Met) occurs in both sequences to yield the number 

15 of matched positions, dividing the number of matched positions by the total number of 
positions in the window of comparison (/.e., the window size), and multiplying the result 
by 100 to yield the percentage of sequence identity. For the purposes of the present 
invention, ''sequence identity" will be understood to mean the "match percentage" 
calculated by the DNASIS computer program (Version 2.5 for windows; available from 

20 Hitachi Software engineering Co., Ltd., South San Francisco, CaUfomia, USA) using 
standard defaults as used in the reference manual accompanying the software. 

"Stringency" as used herem, refers to the temperature and ionic strength 
conditions, and presence or absence of certain organic solvents, during hybridisation and 
washing procedures. The higher the stringency, the higher will be the degree of 
25 complementarity between immobiUsed target nucleotide sequences and the labelled probe 
polynucleotide sequences that remain hybridised to the target after washing. 

"Stringent conditions" refers to temperature and ionic conditions xmder which 
only nucleotide sequences having a high frequency of complementary bases v^U hybridise. 
The stringency required is nucleotide sequence dependent and depends upon the various 
30 components present during hybridisation and subsequent washes, and the time allowed for 
these processes. Generally, in order to maximise the hybridisation rate, non-stringent 
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hybridisation conditions are selected; about 20 to 25 °C lower than the thermal melting 
point (Tm). The Tm is the temperature at which 50% of specific target sequence hybridises 
to a perfectiy complementary probe in solution at a defined ionic strength and pH. 
Generally, in order to require at least about 85% nucleotide complementarity of hybridised 
5 sequences, highly stringent washing conditions are selected to be about 5 to 15 ''C lower 
than the Tm. In order to require at least about 70% nucleotide complementarity of 
hybridised sequences, moderately stringent washing conditions are selected to l e about 15 
to 30 °C lower than the Tm. Highly permissive (low stringency) washing conditions may be 
as low as 50 °C below the Tm, allowing a high level of mis-matching between hybridised 
10 sequences. Those skilled in the art will recognise that other physical and chemical 
parameters in the hybridisation and wash stages can also be altered to affect the outcome of 
a detectable hybridisation signal from a specific level of homology between target and 
probe sequences. Other examples of stringency conditions are described in section 3.3. 

By "vector" is meant a nucleic acid molecule, preferably a DNA molecule 

15 derived, for example, from a plasmid, bacteriophage, or plant virus, into which a nucleic 
acid sequence may be inserted or cloned. A vector preferably contains one or more unique 
restriction sites and may be capable of autonomous replication in a defined host cell 
including a target cell or tissue or a progenitor cell or tissue thereof, or be integrable with 
the genome of the defined host such that tiie cloned sequence is reproducible. Accordingly, 

20 the vector may be an autonomously replicating vector, i.e., a vector that exists as an 
extrachromosomal entity, tiie replication of which is independent of chromosomal 
replication, e.g., a linear or closed circular plasmid, an extrachromosomal element, a 
minichromosome, or an artificial chromosome. The vector may contain any means for 
assuring self-replication. Altematively, the vector maybe one which, when introduced into 

25 a cell, is integrated into the genome of the recipient cell and replicated together with the 
chromosome(s) into which it has been integrated. A vector system may comprise a single 
vector or plasmid, two or more vectors or plasmids, which together contain the total DNA 
to be introduced into the genome of the host cell, or a transposon. The choice of the vector 
will typically depend on the compatibility of the vector with the cell into which the vector 

30 is to be introduced. The vector may also include a selection marker such as an antibiotic 
resistance gene that can be used for selection of suitable transformants. Examples of such 
resistance genes are well known to those of skill in the art. 
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As used herein, underscoring or italicising the name of a gene shaU indicate the 
gene, in contrast to its protein product, which is indicated by the name of the gene in the 
absence of any underscoring or italicising. For example, "xabB" shall mean the xabB gene, 
whereas "XabB" shall indicate the protem product of the "xabB'' gene. 

5 2. Isolated polypeptides, biologically active fragments, polypeptide variants and 
derivatives 

2.1 Polvpeptides of the invention 

2.1.1 Albicidin synthetase 

The present inventor has also isolated a gene {xabB) encoding a large modular 

1 0 polyketide synthase (PBCS) linked to a non-ribosomal peptide synthetase (NRPS) (predicted 
Mr 525,695). At 4801 amino acids in length, the product of xabB (XabB) is the largest 
reported PKS-NRPS. Comparison of XabB with available protein sequence databases 
reveals an N-terminal region (from Met-1 to Asp-3235) similar to many microbial modular 
PKSs, and a C-terminal region (from Pro-3236 to Asp-4801) similar to NRPSs. 

15 Recognisable PKS domams commencing at the N-terminus of XabB, are an acyl-CoA 
ligase (AL), acyl carrier protein (ACPI), /3-ketoacyl synthase (KSl), and ^-ketoacyl 
reductase (KR), followed by two consecutive ACPs and one KS (Figure 1). The motifs 
characteristic of these domains are aligned with those from other organisms in Figure 3. 
The AL domain shows 22-30% identity and 50-60% similarity to prokaryotic and 

20 eukaryotic aromatic acid-CoA ligases and long-chain fatty acid-CoA ligases, and contains 
the conserved adenylation core sequence (SGSSG) and the ATPase motif (TGD). The 
three ACP domains show up to 39.2% identity and 78.6% similarity to acyl carrier 
proteins, and all contam a 4'-phosphopantetheinyl binding cofactor box GxDS(l/L) 
(Hopwood and Sherman, 1990), except that A replaces G in ACPI (Figure 3). The two KS 

25 domains show up to 56.1% identity and 80.8% sunilarity to jS-ketoacyl synthases. Both 
contain motif GPxxxxxxxCSxSL around the active site Cys, and two His residues 
downstream of the active site Cys, in motifs characteristic of these enzymes (Donadio et 
al, 1991; Hopwood, 1997; Huang et al, 1998). The KR domain shows up to 27.9% 
identity and 61.8% similarity to /3-ketoacyl reductases, and contains the NAD(P)H binding 

30 site GGxGxLG (Scrutton et al., 1990). 
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At the C-terminus of XabB is an apparent peptide synthetase region linked to the 
PKS module via a peptidyl carrier protein (PCP) domain (Figure 1). The peptide synthetase 
region shows 31-38% identity and 60-63% similarity with members of the peptide 
synthetase family. It displays the ordered condensation, adenylation, and PCP domains 
5 typical of such multienzymes (Marahiel et al, 1997) followed by an extra condensation 
domain. The conserved sequences, characteristic of the domains commonly found in 
peptide synthetases, are compared with those firom XabB in Table 2. 

In more detail, the full-length amino acid sequence of the X, albilineans PKS- 
NRPS, presented in SEQ ID NO: 2, extends 4801 residues and includes the following 
10 sequence signature motifs: 

(a) acyl-CoA ligase (AL) motif I extending from about residue 226 to about residue 
240, and motif 11 extending from about residue 486 to about residue 493; 

(b) jS-ketoacyl synthase 1 (KSl) motif I extending from about residue 897 to about 
residue 913, motif n extending from about residue 1038 to about residue 1047, and 

15 motif in extending from about residue 1080 to about residue 1089; 

(c) jS-ketoacyl synthase 2 (KS2) motif I extending from about residue 2777 to about 
residue 2793, motif 11 extending from about residue 2918 to about residue 2927, and 
motif m extending from about residue 2955 to about residue 2964; 

(d) jS-ketoacyl reductase (KR) motif extending from about residue 1812 to about 
20 residue 1842; 

(e) acyl carrier protein 1 (ACPI) motif extending from about residue 667 to about 
residue 678; 

(f) acyl carrier protein 2 (ACP2) motif extending from about residue 2484 to about 
residue 2495; 

25 (g) acyl carrier protein 3 (ACP3) motif extending from about residue 2568 to about 

residue 2579; 

(h) adenylation domain (A) motif I extending from about residue 3806 to about 
residue 3811, motif n extending from about residue 3851 to about residue 3861, motif 
m extending from about residue 3917 to about residue 3932; motif IV extending from 
30 about residue 3967 to about residue 3970, motif V extending from about residue 4063 to 
about residue 4069, motif VI extending from about residue 4114 to about residue 4128, 
motif Vn extending from about residue 4152 to about residue 4157, motif Vin 
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extending from about residue 4170 to about residue 4189, motif DC extending from 
about residue 4239 to about residue 4245, and motif X extending from about residue 
4259 to about residue 4264; 

(i) peptidyl carrier protein 1 (PCPl) motif extending from about residue 3261 to 
5 about residue 327 1 ; 

(j) peptidyl carrier protein 2 (PCP2) motif extending from about residue 4306 to 
about residue 43 16; 

(k) condensation domain 1 (CI) motif I extending from about residue 3333 to about 
residue 3342, motif n extending from about residue 3381 to about residue 3389, and 

10 motif HI extending from about residue 3456 to about residue 3465, motif IV extending 
from about residue 3495 to about residue 3501, motif V extending from about residue 
3606 to about residue 3617, motif VI extending from about residue 3641 to about 
residue 3647, motif Vn extending from about residue 3658 to about residue 3665; and 
(1) condensation domain 2 (C2) motif I extending from about residue 4374 to about 

15 residue 4383, motif II extending from about residue 4421 to about residue 4429, and 
motif in extending from about residue 4498 to about residue 4507, motif IV extending 
from about residue 4538 to about residue 4544, motif V extending from about residue 
4649 to about residue 4659, motif VI extending from about residue 4685 to about 
residue 4691, motif Vn extending from about residue 4701 to about residue 4708. 

20 From the above signature motifs, it can be deduced that XabB commences with an 

AL domain (residues 1-629) followed by an ACP domain (ACPI, residues 630-731). In 
other PKS systems, an N-terminal AL is involved in activation and incorporat^.on of 3,4- 
dihydroxycyclohexane carboxylic acid, 3 -amino-5 -hydroxy benzoic acid (AHBA), or long- 
chain fatty acid as a starter (Aparicio et al, 1996; Motamedi and Shafiee, 1998; Tang et 

25 a/., 1998; Duitman et al, 1999). The second module in XabB contains a KS (residues 732- 
1165), and a KR (residues 1811-1971) upstream of two ACPs (residues 2457-2522, 2544- 
2613), but lacks any discemable AT domain (Figure 1). The third module contains a KS 
(residues 2630-3046) followed by a PCP (residues 3221-3307) at the start of the XabB 
NRPS region. 

30 Four other fused PKS/NRPS systems (Albertini et al, 1995; Gehring et al, 1998; 

Duitman et al, 1999; Paitan et al, 1999) are known, three of which lack recognisable AT 
domains (Figure 6). Yersinia pestis HMWPl contains a typical PKS elongation module 
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(including AT), and an NRPS module with a terminating TE domain. It is the third protein, 
following an AL (YbtE) and NRPS (HMWP2) in the biosynthetic apparatus for 
yersiniabactin (Geliring et al, 1998). B, subtilis MycA bears the closest resemblance to 
XabB, showing PKS initiation and elongation modules linked via an amino transferase 
5 (AMT) domain to the NRPS region. In B, subtilis PksK and M xanthus Tal, the NRPS 
region precedes the PKS region. Separate AT enzymes encoded elsewhere in the genome 
may operate in trans to load the appropriate acyl groups onto the ACPs in the elongation 
modules of these PKSs. Candidates are a malonyl-CoA tranascylase gene (fenF) located 
immediately upstream oimycA (Duitman et al, 1999), and an acyltransferase gene located 
10 20 kb upstream of tal (Paitan et aL, 1999). Accordingly, it is beheved that one or more 
such trans-^ctixig AT enzymes may also be involved in connection with the operation of 
XabB. 

From the characteristics of albicidin, and the architecture of the XabB PKS region 
(Figure 1), the inventor considers that: (i) the AL couples coenzyme A to a shikimate- 

15 derived acyl residue in an ATP-dependent reaction, and loads the activated acyl unit onto 
the 4'-phosphopantetheine prosthetic arm of ACPI; (ii) an acyl group is loaded onto ACP2 
or ACPS by a separate acyltransferase; (iii) the KSl domain accepts the acyl residue from 
ACPI onto a conserved cysteine residue, then transfers it by decarboxylative condensation 
onto the acyl group tethered to ACP2 or ACPS; (iv) the tethered chain is modified by KR; 

20 (v) the assembled polyketide intermediate is translocated via KS2 onto the 4- 
phosphopantetheine prosthetic arm of PCPl, at the start of the NRPS region. 

The A domain in the NRPS region of XabB contains ten conserved sequences (Al 
to AlO, Table 2) identified as AMP, ATP-Mg binding, adenine binding or ATPase sites 
(Turgay et aly 1992; Marahiel et a/., 1997). In other NRPS systems, A domains select and 

25 load a particular amino acid, nonproteinogenic amino, hydroxyl or carboxy acid (Marahiel 
et al, 1997). Substrate specificity is determined at the binding pocket, consisting of a 
stretch of about 100 amino acid residues between highly conserved motif ^.4 and A5 
(Conti et al, 1997). Sequence alignments for this region reveal some clusters 
corresponding with the loaded substrate (Stachelhaus et al, 1999). The A donain from 

30 XabB falls in a diverse cluster of NRPS modules involved in loading of His, Leu or 
aromatic amino acids (Phe and Tyr) in other NRPS systems (Figure 7). Bared on the 
architecture of the XabB NPRS region, it can be inferred that the polyketide intermediate 
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tethered on PCPl is accepted by CI and coupled to the amino, hydroxyl, or carboxy acid 
preloaded by A onto PCP2. The final condensation domain at the C-tenninus of XabB is 
probably involved in peptide-chain termination and cyclisation, as in enniatin, HC-toxin, 
rapamycin and FK506 systems (Konz and Marahiel, 1999). 

5 2,L2 Phosphopantetheinyl transferase associated with albicidin biosynthesis 

The present invention also provides a gene {xabA) bomX, albilineans encoding a 
phosphopantetheinyl transferase (PPTase) associated with XabB function. In this regard, 
XabB contains five carrier protein (ACP/PCP) domains, to which the growing polyketide 
or polypeptide chain could be covalently tethered. Each fimctional ACP or PCP domain 
10 must have a specific serine side chain phosphopantetheinylated by a dedicated PPTase 
(Lambalot et al, 1996). The product ofxabA (XabA) fiilfils this fimction and is required 
for post-translational activation of synthetases in the albicidin biosynthetic pathway. 

The fiill-length amino acid sequence of this X. albilineans PPTase, presented in 
SEQ ID NO: 83, extends 278 residues and includes the sequence signature motifs for 

1 5 PPTases which are located as follows: (I) motif I spanning firom about residue 1 59 to about 
residue 167; and (II) motif II spanning firom about residue 207 to about residue 218, of 
SEQ ID NO: 83. The sequence intervening between the two motifs extends fi*om about 
residue 168 to about residue 206 of SEQ ID NO: 83. These conserved sequence motifs and 
the intervening sequence are presented for convenience in SEQ ID NO: 89, 93 and 91, 

20 respectively. 

The deduced xabA gene product has 56-62 % overall similarity to EntD proteins 
for enterobactin biosynthesis and 39-56 % overall similarity to other enzymes in the 
phosphopantetheinyl transferase superfamily. Like entD, xabA includes rarely used codons, 
which may impose post-transcriptional control on the rate of gene product formation 
25 (Coderre & Earhart, 1989). Codon optimisation of xabA may, therefore, be usefiil for 
enhancing the production of XabA. 

1L3 Methyltransferase associated with albicidin biosynthesis 

The invention also provides a gene (^xabC) fi:om X. albilineans encoding a 
methyltransferase enzyme, more particularly an O-methyltransferase enzyme, which is 
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required for albicidin production and which when expressed above natural levels leads to 
increased levels and/or functional activities of albicidin antibiotics. The full-length amino 
acid sequence of this X. albilineans methyltransferase, presented in SEQ ID NO: 95, 
extends 343 residues and includes methyltransferase consensus sequence motifs which are 
5 located as follows: (I) motif I spanning from about residue 173 to about residue 180; (II) 
motif n spanning from about residue 236 to about residue 243; and (UT) motif m spanning 
from about residue 266 to about residue 274, of SEQ ID NO: 95. These conserved 
sequence motifs are presented for convenience in SEQ ID NO: 99, 101 and 103, 
respectively. 

10 2.2 Bioloeicallv active fragments 

The invention also contemplates biological fragments of the above polypeptides 
of at least 6 and preferably at least 8 amino acids in length, which comprise an activity 
associated with the domains described above. For example, biologically active fragments 
may be produced accordmg to any suitable procedure known in the art. For example, a 

15 suitable method may include fu-st producing a fragment of a parent polypeptide as 
described in Section 2.1 and then testing the fragment for the appropriate biological 
activity. In one embodiment, the fragment is derived from the albicidin PKS-NRPS of the 
invention and is tested for an activity selected form the group consisting of acyl-CoA 
ligase activity, jS-ketoacyl synthase activity, jS-ketoacyl reductase, acyl carrier protein 

20 activity, adenylation activity, peptidyl carrier protein activity and condensation activity. 

Any assays that detects or preferably measure such activities is contemplated in 
the practice of the present invention. The biologically active fragment suitably comprises 
any one or more of the sequence signature motifs described above, or variants thereof. 
Preferably, the biologically active fragment comprises all said sequence signature motifs, 
25 or variants thereof 

In another embodiment, the fragment is derived from the PPTase of the invention 
and is tested for PPTase activity according to standard assays known to personr of skill in 
the art. Suitably, the PPTase catalyses the pantetheinylation, more preferably the 
phosphopantetheinylation, of proteins involved in antibiotic biosynthesis, preferably 
30 albicidin biosynthesis. The biologically active fragment preferably comprises the 
consensus sequence motifs set forth in SEQ ID NO: 89 and 93, or variant thereof and thus. 
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more preferably comprises the sequence from about residue 159 to about residue 218, of 
SEQIDNO:83. 

In yet another embodiment, the fragment is derived from the methyltrarisferase of 
the invention and is tested for methyltransferase activity, preferably 0-methyltransferase 
5 activity and more preferably 5-adenosylmethionine-dependent 0-methyltransferase 
activity. Suitably, the methyltransferase catalyses the transfer of one or more methyl 
groups to an antibiotic precursor, more preferably an albicidin precursor or an intermediate 
relating to the biosynthesis of albicidins. The biologically active fragment preferably 
comprises the consensus sequence motifs set forth in SEQ ID NO: 99, 101 and 103, or 

10 variant thereof and thus, more preferably comprises the sequence from about residue 173 
to about residue 274 of SEQ ID NO: 95 (Le., SEQ ID NO: 105), or variant of said 
sequence. In an especially preferred embodiment, the biologically active fragment 
comprises the sequence from about residue 1 to about residue 277 of SEQ ID NO: 95 {i.e., 
SEQ ID NO: 107), or variant of said sequence. An exemplary polynucleotide encoding this 

1 5 sequence is cloned in vector pLXABB described infra. 

Alternatively, biological activity of the fragment is tested by introducing a 
polynucleotide from which a fragment of a parent polypeptide can be translated into a cell, 
and detecting one or more of the above activities, which is indicative of said fragment 
being a biologically active fragment. In one embodiment, such activity can be assayed by 
20 introducing into an albicidin deficient xabB' X. albilineans mutant (e.g., strain LSI 57 
described herein) a polynucleotide from which a PKS-NRPS-associated fragment can be 
produced and assaying for antibiotic activity using a microbial plate assay, as for instance 
described in Example 1. 

In another embodunent embodiment, PPTase activity is assayed by introducing 
25 into an albicidin deficient xaWZ albilineans mutant (e.g., strain LS156 described herein) 
a polynucleotide from which a PPTase-associated fragment can be produced and assaying 
for antibiotic activity using a microbial plate assay, as for instance described in Example 2. 

In yet another embodiment, methyltransferase activity is assayed by introducing 
into an albicidin deficient xabC X. albilineans mutant (e.g., strain LS-JPl described 
30 herein) a polynucleotide from which a methyltransferase-associated fragment can be 
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produced and assaying for antibiotic activity as for example described herein using a 
microbial plate assay, as for instance described in Example 3. 



2.3 Polypeptide variants 

The invention also contemplates polypeptide variants of the polypeptides of the 
5 invention wherein said variants have an activity selected form the group consisting of acyl- 
CoA hgase activity, 0-ketoacyl synthase activity, jS-ketoacyl reductase, acyl carrier protein 
activity, adenylation activity, peptidyl carrier protein activity, condensation activity, 
PPTase activity, and methyltransferase activity. Suitable methods of producing polypeptide 
variants include, for example, producing a modified polypeptide whose sequence is 
- 10 distinguished from a parent polypeptide as described in Section 2.1 or a biologically active 
fragment thereof by the substitution, deletion and/or addition of at least one amino acid. 
The modified polypeptide is then tested for one or more of said activities, wherein the 
presence of that activity indicates that the modified polypeptide is a variant of the parent 
polypeptide. 

15 In another embodiment, a polypeptide variant is produced by introducing into a 

cell a polynucleotide from which a modified polypeptide can be translated, and detecting 
one or more of the activities described above that are associated with the cell, which is 
indicative of the modified polypeptide being a polypeptide variant. 

In general, variants will have at least 60%, more suitably at least 70%, preferably 
20 at least 80%, and more preferably at least 90% homology to a polypeptide as for example 
shown in SEQ ID NO: 4, or a biological fragment thereof. It is preferred that variants 
display at least 60%, more suitably at least 70%, preferably at least 75%, more preferably 
at least 80%, more preferably at least 85%, more preferably at least 90% and still more 
preferably at least 95% sequence identity with a parent polypeptide as described in Section 
25 2.1 or a biologically active fragment thereof In this respect, the window of comparison 
preferably spans about the fiill length of the polypeptide or of the biologically active 
fragment. Suitable variants can be obtained from any secondary metabolite-producing 
organism, and preferably from an albicidin-producing organism. 

Alternatively polypeptide variants according to the invention can be identified 
30 either rationally, or via estabUshed methods of mutagenesis (see, for example, Watson, J. 
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D. et al, 'MOLECULAR BIOLOGY OF THE GENE", Foxirth Edition, 
Benjamin/Curtimings, Menlo Park, Calif., 1987). SignijBcantly, a random mutagenesis 
approach requires no a priori information about the gene sequence that is to be mutated. 
This approach has the advantage that it assesses the desirability of a particular mutant 
5 based on its function, and thus does not require an understanding of how or why the 
resultant mutant protein has adopted a particular conformation. Indeed, the random 
mutation of target gene sequences has been one approach used to obtain mutant proteins 
having desired characteristics (Leatherbarrow, R. 1986, ProL Eng. 1: 7-16; Knowles, J. 
R., 1987, Science 236: 1252-1258; Shaw, W. V., 1987, Biochem. J. 246: 1-17; Gerit, L A. 
10 1987, Che?n. Rev. 87: 1079-1105). Alternatively, where a particular sequence alteration is 
desired, methods of site-directed mutagenesis can be employed. Thus, such methods may 
be used to selectively alter only those amino acids of the protein that are believed to be 
important (Craik, C. S., 1985, Science 228: 291-297; Cronin, et al, 1988, Biochem. 11: 
4572-4579; Wilks, et al, 1988, Science 242: 1541-1544). 

1 5 Variant peptides or polypeptides, resulting from rational or established methods of 

mutagenesis or from combinatorial chemistries may comprise conservative amino acid 
substitutions. Exemplary conservative substitutions in a polypeptide or polypeptide 
fragment according to the invention may be made according to the foUovnng table: 

TABLES 



Original Residue 


Exemplary Substitutions 


Ala 


Ser 


Arg 


Lys 


Asn 


Ghi, His 


Asp 


Glu 


Cys 


Ser 


Ghi 


Asn 


Glu 


Asp 


Gly 


Pro 
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Original Residue 


jcjcepipiury oiiosniuuujis 


His 


Asn, CjIh 


lie 


Leu, Val 


Leu 


lie, Val 


Lys 


Arg, Cjin, Olu 


Met 


LrCU, lie. 


Phe 


Metj Leu, lyr 




Thr 


■ -Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


He, Leu 



Substantial changes in function are made by selecting substitutions that are less 
conservative than those shown in TABLE B. Other replacements would be non- 
conservative substitutions and relatively fewer of these may be tolerated. Generally, the 

5 substitutions which are likely to produce the greatest changes in a polypeptide's properties 
are those in which (a) a hydrophiUc residue (e.g., Ser or Asn) is substituted for, or by, a 
hydrophobic residue (e.g., Ala, Leu, lie, Phe or Val); (b) a cysteine or prohne is substituted 
for, or by, any other residue; (c) a residue having an electropositive side chain (e.g., Arg, 
His or Lys) is substituted for, or by, an electronegative residue (e.g., Glu or Asp) or (d) a 

10 residue having a smaller side chain (e.g., Ala, Ser) or no side chain (e.g., Gly) is 
substituted for, or by, one having a bulky side chain (e.g., Phe or Trp). 

2.4 Polypeptide derivatives 

A polypeptide can typically tolerate one or more amino acid deletions and 
insertions in its amino acid sequence without loss or significant loss of a desired activity, 
15 Accordingly, the invention also contemplates derivatives of the parent polypeptides of the 
invention described in Section 2.1 or biologically active firagments thereof or -ariants of 
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these, which include amino acid deletions and/or additions, wherein said derivatives 
comprise one or more activities selected form the group consisting of acyl-CoA Ugase 
activity, /3-ketoacyl synthase activity, |3-ketoacyl reductase, acyl carrier protein activity, 
adenylation activity, peptidyl carrier protein activity, condensation activity. PPTase 
5 activity and methyltransferase activity associated with antibiotic biosynthesis, and 
preferably with albicidin biosynthesis. 

Preferred derivatives of the invention include PKS-NRPS molecules with altered 
activities in one or more respects and thus produce polyketides other than the albicidin 
natural product(s) of the XabB. A PKS-NRPS derived from XabB by such alteration 

10 includes a modular PKS-NRPS (or its corresponding encoding gene(s)) that retains the 
scaffolding of the utiUsed portion encoded by the naturaUy occurring gene. Not all domains 
or modules need be altered. On the constant scaffold, at least one enzymatic activity is 
mutated, deleted, replaced, or inserted so as to alter the activity of the resulting PKS-NRPS 
relative to the original or parent PKS-NRPS. Alteration results when these activities are 

1 5 deleted or are replaced by a different version of the activity, or simply mutated in such a 
way that a polyketide other than the natural product results from these collective activities. 
This occurs because there has been a resulting alteration of the starter unit and/or 
elongation unit, stereochemistry, chain length or cycUsation, and/or reductive or 
dehydration cycle outcome at a corresponding position in the product polyketide. Where a 

20 deleted activity is replaced, the origin of the replacement activity may come from a 
corresponding activity in a different naturally occurring PKS or PKS-NRPS or from a 
different region of the albicidin PKS-NRPS. Any or all PKS/NRPS genes may be included 
in the derivative or portions of any of these may be included, but the scaffolding of the 
albicidin PKS-NRPS protein is preferably retained in whatever derivative is constructed. 

25 Thus, a PKS-NRPS derived from the albicidin PKS-NRPS includes a PKS-NRPS 

that contains the scaffolding of all or a portion of XabB. The derived PKS-NRPS also 
contains at least two elongation modules that are fimctional and preferably at least three 
elongation modules. The derived PKS-NRPS also contains mutations, deletions, insertions, 
or replacements of one or more of the activities of the fimctional domams or modules of 

30 XabB so that the nature of the resulting polyketide is altered. Exemplary embodiments 
include those wherein a KS or ACP domain has been deleted or replaced by a version of 
the activity from a different PKS/NRPS or from another location within XabB. Also 
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contemplated are derivatives where at least one non-condensation cycle enzymatic activity 
(KR, KR, or A) has been deleted or added or wherein any of these activities has been 
mutated so as to change the structure of the polyketide synthesised by the PKS. 

Other derivatives contemplated by the present invention include fusion of the 
5 polypeptides, fragments and polypeptide variants of the invention with other polypeptides 
or proteins. For example, it will be appreciated that said polypeptides, fragments or 
variants may be incorporated into larger polypeptides, and that such larger polypeptides 
may also be expected to have one or more of the activities mentioned above. The 
polypeptides, fragments or variants of the invention may be fused to a further protein, for 
10 example, which is not derived from the original host. The flirther protem may assist in the 
purification of the fusion protein. For instance, a polyhistidine tag or a maltose binding 
protein may be used in this respect as described in more detail below. Other poss- ble fusion 
proteins are those which produce an immunomodulatory response. Particular examples of 
such proteins include Protein A or glutathione S-transferase (GST). 

15 Other derivatives contemplated by the invention include, but are not limited to, 

modification to side chains, incorporation of unnatural amino acids and/or their derivatives 
during peptide, polypeptide or protein synthesis and the use of crosslinkers and other 
methods which impose conformational constraints on the polypeptides, fragments and 
variants of the invention. Examples of side chain modifications contemplated by the 

20 present invention include modifications of amino groups such as by acylation with acetic 
anhydride; acylation of amino groups with succinic anhydride and tetrahydrophthalic 
anhydride; amidination with methylacetimidate; carbamoylation of amino groups with 
cyanate; pyridoxylation of lysine with pyridoxal-5-phosphate followed by reduction with 
NaBHU; reductive alkylation by reaction with an aldehyde followed by reduction with 

25 NaBEU; and trinitrobenzylation of amino groups with 2, 4, 6-trinitrobenzene sulphonic acid 
(TNBS). The carboxyl group may be modified by carbodiiirdde activation via O- 
acylisourea formation followed by subsequent derivatisation, by way of example, to a 
corresponding amide. The guanidine group of arginine residues may be modified by 
formation of heterocyclic condensation products with reagents such as 2,3-butanedione, 

30 phenylglyoxal and glyoxal. Sulphydryl groups may be modified by methods such as 
performic acid oxidation to cysteic acid; formation of mercurial derivatives using 4- 
chloromercuriphenylsulphonic acid, 4-chloromercuribenzoate; 2-chloromercuri-4- 
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nitrophenol, phenylmercury chloride, and other mercurials; formation of a mixed 
disulphides with other thiol compounds; reaction with maleimide, maleic anhydride or 
other substituted maleimide; carboxymethylation with iodpacetic acid or iodoacetamide; 
and carbamoylation with cyanate at alkaline pH. Tryptophan residues may be modified, for 
5 example, by alkylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide or 
sulphonyl halides or by oxidation with N-bromosuccinimide. Tyrosine residues may be 
modified by nitration witii tetranitromethane to foma a 3-nitrotyrosine derivative. The 
imidazole ring of a histidine residue may be modified by N-carbethoxylation with 
diethylpyrocarbonate or by alkylation with iodoacetic acid derivatives. 

10 Examples of incorporating unnatural amino acids and derivatives during peptide 

synthesis mclude but are not Umited to, use of 4-amino butyric acid, 6-aminohexanoic acid, 
4-amino-3-hydroxy-5-phenylpentanoic acid, 4-amino-3-hydroxy-6-methylheptanoic acid, 
t-butylglycine, norleucine, norvaline, phenylglycine, ornithine, sarcosine, 2-thienyl alanine 
and/or D-isomers of amino acids. A list of unnatural amino acids contemplated by the 

1 5 present invention is shown in TABLE C. 



TABLE C 



Non-conveiitidnal amino acid ^ - 


■Non-conventw^^ , 


a-aminobutyric acid 


L-N-methylalanine 


a-amino-a-methylbutyrate 


L-N-methylarginine 


aminocyclopropane-carboxylate 


L-N-methylasparagine 


aminoisobutyric acid 


L-N-methylaspartic acid 


aminonorbomyl-carboxylate 


L-N-methylcysteine 


cyclohexylalanine 


L-N-methylglutamine 


cyclopentylalanine 


L-N-methylglutamic acid 


L-N-methylisoleucine 


L-N-methylhistidine 


D-alanine 


L-N-methylleucine 


D-arginine 


L-N-methyllysine 


D-aspartic acid 


L-N-methyhnethionine 
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Non-conventional ammo acid 


Non-conventional amino acid 


J. " 

D-cysteine 


Lr JN -rneinyiiiorieuciiie 


D-glutamate 


L-N-methylnorvaline 


D-glutamic acid 


1^- JN -mc luyiorniininc 


D-histidine 


LfVi -mcinyipncnyiaidLUiic 


D-isoIeucine 


L-N-methylproline 


D-leucine 


j-(-iN -incQiyiscriiic 


D-lysine 


i^-jN -incinyunreonine 


D-methionine 


L-N-methyltryptophan 


D-omithine 


L-N-methyltyrosme 


D-phenylalanine 


JL-iN -mcinyivaiinc 


D-proline 


L-N-methylethylglycine 


D-serine 


L-N-methyl-t-butylglycine 


D-threonine 


L-norleucine 


D-tryptophan 


L-norvaline 


D-tyrosine 


of-methyl- amuioi sobutyrat © 


D-valine 


u-inciny 1-7^" snLiiii 


D-a-methylalanine 


Of-ineiiiyicycioncxyiaianine 


D-a-methylarginine 


o^ineinyicyicopeniyiaianine 


D-of-methylasparagine 


Orin euiyi -u-nap inyi aiamii c 


D-a-methylaspartate 


of-methylp enicillamiiie 


D-Of-methylcysteine 


IN - ainino D uiy ly giy t/iiic 


D-a-methylglutamine 


N-(2-aminoethyl)glycine 


D-a-methylhistidine 


N-(3-ainmopropyl)glycine 


D-a-methylisoleucine 


N-amino-a-methylbutyrate 


D-a-methylleucine 


a-napthylalanine 



wo 02/24736 



PCT/AUOl/01190 



-55- 



Non-conventional amino acid 


jy07l~CUrlVCilllUflCll Uf flit 11/ C*u*U 


D-a-methyllysine 


IN -ucnzyigiy t/iiic 


D-CMnethylmethionine 




D-a-methylomithiine 


"M /'pQrViQmvlmpfllvl^P^lvfil'ne 

IN -^c aru dJiiy JiJic uj,y 1^ gi y iiit^ 


D-a-methylphenylalanine 


"M ^>Qt4lr\YvptV^v^^^y1v^^^^e 
IN-^^Z-CdiuOAycuiyi/giy^iiic 


D-a-methylproline 


JN-^carDOxyincuj.yi^giy wiiic 


D-a-inethylserine 


IN - cy CIO u u ly igiy ^iiiw 


D-a-methylthreonine 


XT r»\/f»1r»Vipr\tvlcy1\/Pinft 
IN -Ly i/iuiicp ly igiy ^i-Liw 


D-a-methyltryptophan 


iN-cycionvAyigiywuc 


D-a-methyltyrosine 


IN -cyciocicuyigiy v/iiic 


L-a-methylleucine 


j_>-Cc-i]icinyiiysinc 


L-a-methylmethionine 


j^-OriiiciQyinori e uomc 


L-a-methylnorvatine 


Xj- Of-ine inyionu Lniiio 


L-a-methylphenylalanine 


L-a-methylproline 


L-a-methylserine 


L-a-methylthreonine 


T -rv-mptVi vltTVntnnliaTl 


L-a-raethyltyrosine 


L-a-methylvaline 


L-N-methylhomophenylalanine 


N-(N-(2,2-diphenylethyl 
carbamylinetliyl)glycine 


N-(N-(3,3-diphenylpropyl 
carbamylmethyl)glycine 


1 -caiboxy- l-(2,2-diphenyl-ethyl 
amino)cyclopropane 





Also contemplated is the use of crosslinkers, for example, to sterilise 3D 
conformations of the polypeptides, iOragments or variants of the invention, using homo- 
bifiinctional cross linkers such as bifimctional imido esters having (CH2)n spacer groups 
5 with n = 1 to n = 6, glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifunctional 
reagents which usually contain an amino-reactive moiety such as N-hydroxysuccinimide 
and another group specific-reactive moiety such as maleunido or dithio moiety or 
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carbodiimide. In addition, peptides can be confonnationally constrained, for example, by 
introduction of double bonds between Ca and atoms of amino acids, by incorporation of 
Ca and N(rmethylamino acids, and by formation of cyclic peptides or analogues by 
introducing covalent bonds such as forming an amide bond between the N and C termini 
5 between two side chains or between a side chain and the N or C terminus of the peptides or 
analogues. For example, reference may be made to: Marlowe (1993, Biorganic & 
Medicinal Chemistry Letters 3: 437-44) who describes peptide cyclisation on TFA resin 
using trimethylsilyl (TMSE) ester as an orthogonal protecting group; Pallin and Tarn 
(1995, y. Chem, Soa Chem. Comm, 2021^2022) who describe the cyclisation of 

10 improtected peptides in aqueous solution by oxime formation; Algin et al (1994, 
Tetrahedron Letters 35: 9633-9636) who disclose solid-phase synthesis of head-to-tail 
cycHc peptides via lysine side-chain anchoring; Kates et al (1993, Tetrahedron Letters 34: 
1549-1552) who describe the production of head-to-tail cyclic peptides by three- 
dimensional solid phase strategy; Tumelty et al (1994, 7. Chenu Soc. Chem, Comm, 1067- 

15 1068) who describe the synthesis of cyclic peptides jfrom an immobilised activated 
intermediate, wherein activation of the immobilised peptide is carried out with N- 
protecting group intact and subsequent removal leadmg to cyclisation; McMruray et al 
(1994, Peptide Research 7: 195-206) who disclose head-to-tail cycUsation of peptides 
attached to insoluble supports by means of the side chains of aspartic and glutamic acid; 

20 Hruby et al (1994, Reactive Polymers 22: 231-241) who teach an alternate method for 
cyclising peptides via solid supports; and Schmidt and Langer (1997, J. Peptide Res, 49: 
67-73) who disclose a method for synthesising cyclotetrapeptides and cyclopentapeptides. 
The foregoing methods may be used to produce confonnationally constrained polypeptides 
that comprise one or more activities selected form the group consisting of acyl-CoA ligase 

25 activity, jS-ketoacyl synthase activity, jS-ketoacyl reductase, acyl carrier protem activity, 
adenylation activity, peptidyl carrier protein activity, condensation activity, PPTase 
activity and methyltransferase activity associated with the production of polyketides and 
particularly albicidins or analogues thereof 

The invention also contemplates polypeptides, fragments or variants of the 
30 invention that have been modified using ordinary molecular biological techniques so as to 
improve their resistance to proteolytic degradation or to optimise solubility properties or to 
render them more suitable as an immxmogenic agent. 
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3, Polynucleotides of the invention 

3.1 Polynucleotides encoding tiolypeptides of the invention 

3.1.2 Albicidin synthetase-encoding polynucleotides 

The invention further provides a polynucleotide that encodes a PKS-NRPS 
5 polypeptide of the invention, or biologically active fragment thereof, or /ariant or 
derivative of these as defmed above. In one embodiment, the polynucleotide comprises the 
entire sequence of nucleotides set forth in SEQ K) NO: 1. SEQ ID NO: 1 corresponds to a 
16511-bp albilineans xabB cistron. SEQ ED NO: 3, defines the full-length coding 
sequence of xabB and encodes various sequence signature motifs at the following 
10 nucleotide positions: 

(a) acyl-CoA ligase (AL) motif I from about nucleotide 676 to about nucleotide 720, 
and motif n from about nucleotide 1456 to about nucleotide 1477; 

(b) i3-ketoacyl synthase 1 (KSl) motif I from about nucleotide 2689 to about 
nucleotide 2739, motif H from about nucleotide 3112 to about nucleotide 3141, and 

1 5 motif EI from about nucleotide 3238 to about nucleotide 3267; 

(c) i3-ketoacyl synthase 2 (KS2) motif I from about nucleotide 8329 to about 
nucleotide 8379, motif II from about nucleotide 8752 to about nucleotide ^781, and 
motif EI from about nucleotide 8863 to about nucleotide 8892; 

(d) jS-ketoacyl reductase (KR) motif from about nucleotide 5434 to about nucleotide 
20 5526; 

(e) acyl carrier protein 1 (ACPI) motif from about nucleotide 1999 to about 
nucleotide 2034; 

(f) acyl carrier protein 2 (ACP2) motif from about nucleotide 7450 to about 
nucleotide 7485; 

25 (g) acyl carrier protein 3 (ACP3) motif from about nucleotide 7702 to about 

nucleotide 7735; 

(h) adenylation domain (A) motif I from about nucleotide 11416 to about nucleotide 
11433, motif n from about nucleotide 11551 to about nucleotide 11583, motif EI from 
about nucleotide 11749 to about nucleotide 11796; motif IV from about aucleotide 
30 11899 to about nucleotide 11910, motif V from about nucleotide 12187 to about 
nucleotide 12207, motif VI from about nucleotide 12340 to about nucleotide 12384, 
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motif Vn from about nucleotide 12454 to about nucleotide 12471, motif Vm from 
about nucleotide 12508 to about nucleotide 12567, motif DC from about nucleotide 
12715 to about nucleotide 12735, and motif X from about nucleotide 127 /5 to about 
nucleotide 12792; 

5 (i) peptidyl earner protein 1 (PCPl) motif from about nucleotide 9781 to about 

nucleotide 9813; 

0) peptidyl carrier protein 2 (PCP2) motif from about nucleotide 12916 to about 
nucleotide 12948; 

(k) condensation domain 1 (CI) motif I from about nucleotide 9997 to about 
10 nucleotide 10026, motif H from about nucleotide 10141 to about nucleotide 10167, and 
motif m from about nucleotide 10366 to about nucleotide 10395, motif IV from about 
nucleotide 10483 to about nucleotide 10503, motif V from about nucleotide 10816 to 
about nucleotide 10851, motif VI from about nucleotide 10921 to about nucleotide 
10941, motif vn from about nucleotide 10972 to about nucleotide 10995; and 
15 (1) condensation domain 2 (C2) motif I from about nucleotide 1312C to about 

nucleotide 13149, motif H from about nucleotide 13261 to about nucleotide 13287, and 
motif m from about nucleotide 13492 to about nucleotide 13521, motif IV from about 
nucleotide 13612 to about nucleotide 13632, motif V from about nucleotide 13945 to 
about nucleotide 13977, motif VI from about nucleotide 14053 to about nucleotide 
20 14073, motif vn from about nucleotide 14101 to about nucleotide 14124. 

Those of skill in tiie art wiU recognise tiiat, due to tiie degenerate nature of the 
genetic code, a variety of polynucleotides differing in their nucleotide sequences can be 
used to encode a given amino acid sequence of tiie invention. The native polynucleotide 
sequence encoding the PKS-NRPS of ^ albilineans is shown herein merely to illustrate a 
25 preferred embodiment of the invention, and the invention includes polynucleotides of any 
sequence that encode tiie amino acid sequences of tiie polypeptides and proteins of tiie 
inventioa 

3,1.2 PPTase-encoding polynucleotides 

The invention further provides a polynucleotide that encodes a PPTase 
30 polypeptide of tiie invention, or biologically active fragment tiiereof, or variant or 
derivative of these as defined above. Li one embodiment, tiie polynucleotide comprises the 
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entire sequence of nucleotides set forth in SEQ ID NO: 82. SEQ ID NO: 82 corresponds to 
a 1200-bp X, albilineans xabA cistron. This sequence encodes a PPTase catalytic domain 
from about nucleotide 475 to about nucleotide 654. This domain comprises two conserved 
PPTase sequence motifs: (T) motif I encoded by a nucleotide sequence from about 
5 nucleotide 475 to about nucleotide 501; and (II) motif II encoded by a nucleotide sequence 
from about nucleotide 619 to about nucleotide 654, of SEQ ID NO: 82. The intervening 
amino acid sequence, linking motifs 1 and II, is encoded by a nucleotide sequence from 
about nucleotide 502 to about nucleotide 618 of SEQ ID NO: 82. The said nucleotide 
sequences are presented for convenience in SEQ ID NO: 86, 88, 92 and 90, respectively. 
10 Suitably, the polynucleotide comprises the sequence set forth in SEQ ID NO: 84, which 
defines the full-length coding sequence of xabA. Alternatively, the polynucleotide 
comprises a contiguous sequence of nucleotides contained within the sequence set forth in 
SEQ ID NO: 86, which encodes the PPTase catalytic domain, 

3.13 Methyltransferase-encoding polynucleotides 

15 The invention fiirther provides a polynucleotide that encodes a methyltransferase 

polypeptide of the invention, or biologically active fragment thereof, or variant or 
derivative of these as defined above. In one embodiment, the polynucleotide comprises the 
entire sequence of nucleotides set forth m SEQ ID NO: 94. SEQ ID NO: 94 corresponds to 
a 1515-bp X, albilineans xabC cistron. This sequence encodes three conserved 

20 methyltransferase sequence motifs: (T) motif I encoded by a nucleotide sequence from 
about nucleotide 565 to about nucleotide 585; (II) motif II encoded by a nucleotide 
sequence from about nucleotide 741 to about nucleotide 774; and (HI) motif m encoded by 
a nucleotide sequence from about nucleotide 841 to about nucleotide 867, or SEQ ID NO: 
94. The said nucleotide sequences are presented for convenience in SEQ ID NO: 98, 100 

25 and 102, respectively. Suitably, the polynucleotide comprises the sequence set forth in 
SEQ ID NO: 96, which defines the full-length coding sequence of xabC Alternatively, the 
polynucleotide comprises a contiguous sequence of nucleotides contained within the 
sequence set forth in SEQ ID NO: 104 or 106, which encode biologically active fragments 
as described in Section 2.2. 
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3.2 Polynucleotide variants 

In general, polynucleotide variants according to the invention comprise regions 
that show at least 60%, more suitably at least 70%, preferably at least 80%, and more 
preferably at least 90% sequence identity over a reference polynucleotide sequence of 
5 identical size Ccomparison window*') or when compared to an aUgned sequence in which 
the alignment is performed by a computer homology program known in the art. What 
constitutes suitable variants may be determined by conventional techniques. For example, 
a polynucleotide comprising at least one sequence selected from the group consisting of 
SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 

10 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 82, 84, 86, 88, 90, 92, 
94, 96, 98, 100, 102 and 104 can be altered using any suitable method including 
conventional recombinant techniques and mutagenesis methods such as random 
mutagenesis (e.g., transposon mutagenesis), oligonucleotide-mediated (or site-directed) 
mutagenesis, PGR mutagenesis and cassette mutagenesis of an earUer prepared variant or 

1 5 non- variant version of an isolated polynucleotide of the invention. 

Alternatively, polynucleotide sequences variants encoding heterologous 
PKS/NRPS enzymes for producing PKS-NRPS variants of the invention may bb obtained 
from other secondary metabolite- or polyketide-producing organisms. For example, such 
variants may be prepared according to the following procedure: 
20 (a) creating primers which are optionally degenerate wherein each comprises a 

portion of a reference polynucleotide encoding a reference polypeptide or Iragment of 
the invention, preferably encoding at least one sequence selected from the group 
consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 83, 
25 87, 89, 91, 93, 95, 99, 101, 103, 105 and 107; 

(b) obtaining a nucleic acid extract from a secondary metabolite-producing 
organism, which is preferably a bacterium, more preferably from a species of the family 
Pseudomonadaceaej more preferably from ^Xanthomonas species; and 

(c) using said primers to amplify, via nucleic acid ampUfication teciiniques, at 
30 least one ampUfication product from said nucleic acid extract, wherein said 

amplification product corresponds to a polynucleotide variant. 
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Suitable nucleic acid amplification techniques are well known to the skilled 
addressee, and include polymerase chain reaction (PGR) as for example described in 
Ausubel et al (supra); strand displacement amplification (SDA) as for example described 
in U.S. Patent No 5,422,252; rolling circle replication (RCR) as for example described in 
5 Liu et al, (1996, 7. Am. Chem, Soc. 118:1587-1594 and Intemational application WO 
92/01813) and Lizardi et al, (Intemational Application WO 97/19193); nucleic acid 
sequence-based amplification (NASBA) as for example described by Sooknanan et al, 
(1994, Biotechniques 17:1077-1080); and Q-jS replicase amplification as for example 
described by Tyagi et al, (1996, Proc, Natl Acad. Scl USA 93: 5395-5400). 

10 Typically, polynucleotide variants that are substantially complementary to a 

reference polynucleotide are identified by blotting techniques that include a step whereby 
nucleic acids are immobilised on a matrix (preferably a synthetic membrane such as 
nitrocellulose), followed by a hybridisation step, and a detection step. Southern blotting is 
used to identify a complementary DNA sequence; northern blotting is used to identify a 

15 complementary RNA sequence. Dot blotting and slot blotting can be used to identify 
complementary DNA/DNA, DNA/KNA or RNA/RNA polynucleotide sequences. Such 
techniques are well known by those skilled in the art, and have been described in Ausubel 
et al (1994-1998, supra) at pages 2.9.1 through 2.9.20. 

According to such methods, Southern blotting involves separating DNA 
20 molecules according to size by gel electrophoresis, transferring the size-separated DNA to 
a synthetic membrane, and hybridising the membrane-bound DNA to a complementary 
nucleotide sequence labelled radioactively, enzymatically or fluorochromatically. In dot 
blotting and slot blotting, DNA samples are directly applied to a synthetic membrane prior 
to hybridisation as above. An alternative blotting step is used when identifying 
25 complementary polynucleotides in a cDNA or genomic DNA library, such as through the 
process of plaque or colony hybridisation. A typical example of this procedure i?^ described 
in Sambrook et al ("Molecular Cloiiing. A Laboratory Manual", Cold Spring Harbour 
Press, 1989) Chapters 8-12. 

Typically, the following general procedure can be used to determine hybridisation 
30 conditions. Polynucleotides are blotted/transferred to a synthetic membrane, as described 
above. A reference polynucleotide such as a polynucleotide of the invention is labelled as 
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described above, and the abUity of this labeUed polynucleotide to hybridise with an 
immobiUsed polynucleotide is analysed. A skilled addressee will recognise that a number 
of factors influence hybridisation. The specific activity of radioactively labelled 
polynucleotide sequence should typically be greater than or equal to about 10« dpm/mg to 

5 provide a detectable signal. A radiolabelled nucleotide sequence of specific activity 10* to 
10" dpm/mg can detect approximately 0.5 pg of DNA. It is well known m the art that 
sufficient DNA must be immobihsed on the membrane to pennit detection. It is desirable 
to have excess immobiUsed DNA, usually 10 Hg. Adding an inert polymer such as 10% 
(w/v) dextran sulfate (MW 500,000) or polyethylene glycol 6000 during hybridisation can 

10 also increase the sensitivity of hybridisation (see Ausubel supra at 2.10.10). 

To achieve meaningful results from hybridisation between a polynucleotide 
immobilised on a membrane and a labelled polynucleotide, a sufficient amount of the 
labelled polynucleotide must be hybridised to the immobifised polynucleotide following 
washmg. Washing ensures that the labelled polynucleotide is hybridised only to the 

15 immobiUsed polynucleotide with a desired degree of complementarity to tl.e labelled 
polynucleotide. It will be understood that polynucleotide variants according to the 
invention will hybridise to a reference polynucleotide under at least low stringency 
conditions. Reference herem to low stringency conditions include and encompass from at 
least about 1% v/v to at least about 15% v/v formamide and from at least aboui 1 M to at 

20 least about 2 M salt for hybridisation at 42" C, and at least about 1 M to at least about 2 M 
salt for washing at 42° C. Low stringency conditions also may include 1% Bo-me Serum 
Albumin (BSA), 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 7% SDS for hybridisation at 
65° C, and (i) 2xSSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHPO* (pH 
7.2), 5% SDS for washing at room temperature. 

25 Suitably, the polynucleotide variants hybridise to a reference polynucleotide under 

at least medium stringency conditions. Medium stringency conditions include and 
encompass from at least about 16% v/v to at least about 30% v/v formamide and from at 
least about 0.5 M to at least about 0.9 M salt for hybridisation at 42° C, and at least about 
0.1 M to at least about 0.2 M salt for washing at 55° C. Medium stringency conditions also 

30 may include 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 
7% SDS for hybridisation at 65° C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM 
EDTA, 40 mM NaHP04 (pH 7.2), 5% SDS for washing at 60-65° C. 
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Preferably, the polynucleotide variants hybridise to a reference polynucleotide 
under high stringency conditions. High stringency conditions include and encompass from 
at least about 31% v/v to at least about 50% v/v formamide and from about 0.01 M to 
about 0.15 M salt for hybridisation at 42** C, and about 0.01 M to about 0.02 M salt for 
5 washing at 55° C. High stringency conditions also may include 1% BSA, 1 mM EDTA, 0.5 
M NaHP04 (pH 7.2), 7% SDS for hybridisation at 65° C, and (i) 0.2 x SSC, 0.1% SDS; or 
(ii) 0.5% BSA, ImM EDTA, 40 mM NaHP04 (pH 7.2), 1% SDS for washing at a 
temperature in excess of 65° C. 

Other stringent conditions are well known in the art. A skilled addressee will 
10 recognise that various factors can be manipulated to optimise the specificity of the 
hybridisation. Optimisation of the stringency of the final washes can serve to ensure a high 
degree of hybridisation. For detailed examples, see Ausubel et al, supra at pages 2.10.1 to 
2.10.16 and Sambrook et al (1989, supra) at sections 1.101 to 1.104. 

While stringent washes are typically carried out at temperatures from about 42° C 
15 to 68° C, one skilled in the art will appreciate that other temperatures may be suitable for 
stringent conditions. Maximum hybridisation rate typically occiurs at about 20° C to 25° C 
below the Tm for formation of a DNA-DNA hybrid. It is well known in the art that the Tm 
is the melting temperature, or temperature at which two complementary polynucleotide 
sequences dissociate. Methods for estimating Tm are well known in the art (see Ausubel et 
20 al, supra at page 2.10.8). 

In general, the Tm of a perfectly matched duplex of DNA may be predicted as an 
approximation by the formula: 

Tm = 8 1.5 + 16.6 (logio M) + 0.41 (%G+C) - 0.63 (% formamide) - (600/length) 

wherein: M is the concentration of Na"*", preferably in the range of 0.01 molar to 
25 0.4 molar; %G-i-C is the sum of guanosine and cytosine bases as a percentage of the total 
number of bases, within the range between 30% and 75% G+C; % formamide is the 
percent formamide concentration by volume; length is the number of base pairs in the 
DNA duplex. 
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The Tra of a duplex DNA decreases by approximately T C with every increase of 
1% in the number of randomly mismatched base pairs. Washing is generally carried out at 
Tm - 1 5° C for high stringency, or Tm - 30° C for moderate stringency. 

In a preferred hybridisation procedure, a membrane (e.g.; a niu-ocellulose 
5 membrane or a nylon membrane) containing immobiUsed DNA is hybridised overnight at 
42° C in a hybridisation buffer (50% deionised formamide, 5xSSC, 5x Denhardi's solution 
(0.1% ficoU, 0.1% polyvinylpyrolUdone and 0.1% bovine serum albumin), 0.1% SDS and 
200 mg/mL denatured sahnon sperm DNA) containing labelled probe. The membrane is 
then subjected to two sequential medium stringency washes {i.e., 2xSSC, 0.1% SDS for 15 
10 min at 45° C, followed by 2xSSC, 0.1% SDS for 15 min at 50° C), followed by two 
sequential higher stringency washes (/.e., 0.2xSSC, 0.1% SDS for 12 min at 55° C 
followed by 0.2xSSC and 0.1%SDS solution for 12 min at 65-68° C. 

Methods for detecting a labelled polynucleotide hybridised to an immobiUsed 
polynucleotide are well known to practitioners in the art. Such methods mclude 
15 autoradiography, phosphorimaging, and chemiluminescent, fluorescent and colorimetric 
detection. 

4, Expression vectors 

The present invention further provides expression vectors designed for genetic 
transformation of cells, preferably prokaryotic cells, comprising a polynucleotide, fragment 
20 or variant according to the invention operably linked to a regulatory polynucleotide. An 
expression vector is typically a nucleic acid that can be introduced into a host cell or cell- 
free transcription and translation system. An expression vector can be maintained 
permanently or transiently in a cell, whether as part of the chromosomal or other DNA in 
the cell or in any cellular compartment, such as a rephcating vector in the cjrtoplasm. 

25 The various components of an expression vector can vary widely, depending on 

the intended use of the vector and especially the host cell(s) in which the vector is intended 
to replicate or drive expression. For example, the regulatory polynucleotide, which is used 
to control expression of a polynucleotide of the invention, will generally be appropriate for 
the host cell used for expression. Numerous types of appropriate expression vectors and 

30 suitable regulatory sequences are known in the art for a variety of host cells. Typically, the 
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regulatory polynucleotide includes, but is not limited to. promoter sequences, leader or 
signal sequences, ribosomal binding sites, transcriptional start and stop sequences, 
translational start and temiination sequences, and enhancer or activator sequences. 
Constitutive or inducible promoters as known in the art are contemplated by the invention. 
5 The promoters may be either naturally occuning promoters, or hybrid promoters that 
combine elements of more than one promoter. 

In a preferred embodiment, the expression vector is operable in a Gram-negative 
prokaryotic cell. A variety of prokaryotic expression vectors, which may be used as a basis 
for constructing the expression vector of the invention. These include but are not limited to 
10 a chromosomal vector (eg., a bacteriophage such as bacteriophage X). an 
extrachromosomal vector {e.g., a plasmid or a cosmid expression vector). The expression 
vector wiU also typicaUy contain an origin of repUcation, which aUows autonomous 
replication of the vector, and one or more selectable marker genes that allow phenotypic 
selection of the transformed cells. 

1 5 The expression vector may also include a fusion partiier (typically provided by the 

expression vector) so that a recombinant polypeptide is expressed as a fusion polypeptide 
with said fusion partner. The main advantage of fusion partners is that they assist 
identification and/or purification of said fusion polypeptide. In order to express said fusion 
polypeptide, it is necessary to ligate a polynucleotide according to the invention into the 
20 expression vector so that the translational reading frames of the fiision partner and the 
polynucleotide coincide. Well known examples of fusion partners include, but are not 
limited to, glutathione-S-transferase (GST), Fc potion of human IgG. maltose binding 
protein (MBP) and hexahistidine (BOSs). which are particularly useful for isolation of the 
fusion polypeptide by affinity chromatography. For the purposes of fiision polypeptide 
25 purification by affinity chromatography, relevant matrices for affinity chromatography are 
glutathione-, amylose-, and nickel- or cobalt-conjugated resins respectively. Many such 
matrices are available in "kit" form, such as the QIAexpress™ system (Qiagen) usefiil with 
(fflSe) fusion partners and the Pharmacia GST purification system, hi a prefenred 
embodiment, the recombinant polynucleotide is expressed in the commercial vector 
30 pFLAG as described more fiilly hereinafter. Another fiision partner well known in the art is 
green fluorescent protein (GFP). This fiision partner serves as a fluorescent "tag" which 
allows the fiision polypeptide of the invention to be identified by fluorescence microscopy 
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or by flow cytometry. The GFP tag is useful when assessing subcellular localisation of the 
fusion polypeptide of the invention, or for isolating cells which express the fusion 
polypeptide of the mvention. Flow cytometric methods such as fluorescence activated cell 
sorting (FACS) are particularly useful in this latter application. Preferably, the fusion 

5 partners also have protease cleavage sites, such as for Factor Xa or Thrombin, v^hich allow 
the relevant protease to partially digest the fusion polypeptide of the invention and thereby 
liberate the recombinant polypeptide of the invention therejfrom. The liberated polypeptide 
can then be isolated from the fusion partner by subsequent chromatographic separation. 
Fusion partners according to the invention also include within their scope "eprtope tags", 

10 which are usually short peptide sequences for which a specific antibody is available. Well 
known examples of epitope tags for which specific monoclonal antibodies are readily 
available include c-Myc, influenza virus, haemagglutinin and FLAG tags. 

Preferred host cells for purposes of selecting vector components for expression 
vectors of the present invention include fungal host cells such as yeast and prokaryotic host 
15 cells such as E, coli and X, albilineans, but mammalian cell cxiltures can also be used. In 
hosts such as yeasts, plants, or mammalian cells that ordinarily do not produce modular 
polyketide synthase enzymes, it may be necessary to provide, also typically by 
recombinant means, suitable holo-ACP synthases to convert the recombinantly produced 
PELS to functionality. 

20 The expression vector may be used to transform the desired host cell to produce a 

recombinant host cell for producing inter alia a recombinant polypeptide or polyketides, 
particularly albicidins or analogues thereof, as described hereinafter. 

5. Methods of preparing the polypeptides of the invention 

Polypeptides of the inventions, including the full-length parent polypeptides 
25 described m Section 2.1, or their biologically active firagments comprising, for example 
one or more domains (or fragments of such domains), or variants or derivatives of these, 
may be prepared by any suitable procedure known to those of skill in the art. For example, 
the polypeptides may be prepared by a procedure including the steps of: - 

(a) preparing a recombinant polynucleotide comprising a nucleotide sequence 
30 encoding a polypeptide comprising the sequence set forth in any one of SEQ ID NO: 4 
or a fragment thereof comprising at least one sequence selected from the group 
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consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 83, 
87, 89, 91, 93, 95, 99, 101, 103, 105 and 107, or variant or derivative of these, which 
nucleotide sequence is operably linked to a regulatory polynucleotide; 
5 (b) introducing the recombinant polynucleotide into a suitable host cell; 

(c) culturing the host cell to express recombinant polypeptide from said 
recombinant polynucleotide; and 

(d) isolating the recombinant polypeptide. 

Suitably, said nucleotide sequence comprises at least one sequence selected from 
10 the group consisting of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 
82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102 and 104. 

The recombinant polynucleotide is preferably in the form of an expression vector, 
which includes a self-replicating extra-chromosomal vector such as a plasmid, or a vector 
15 that integrates into a host genome, as for example described above in Section 4. The step of 
introducing the recombinant polynucleotide into the host cell may be effected by any 
suitable means including transfection, and transformation, the choice of which will be 
dependent on the host cell employed. Such methods are well known to those of skill in the 
art. 

20 Recombinant polypeptides of the invention may be produced by cultxuing a host 

cell transformed with an expression vector containing nucleic acid encoding a polypeptide, 
biologically active fragment, variant or derivative according to the invention. The 
conditions appropriate for protein expression will vary with the choice of expression vector 
and the host cell. This is easily ascertained by one skilled in the art through routine 

25 experimentation. 

Suitable host cells for expression may be prokaryotic or eukaryotic. One preferred 
host cell for expression of a polypeptide according to the invention is a bacterium. The 
bacterium used may be Escherichia coli. Alternatively, the host cell may be an insect cell 
such as, for example, SF9 cells that may be utilised with a baculovirus expression system. 
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The recombinant protein may be conveniently prepared by a person skilled in the 
art using standard protocols as for example described in Sambrook, et al.y MOLECULAR 
CLONING. A LABORATORY MANUAL (Cold Spring Harbor Press, 1989), in particular 
Sections 16 and 17; Ausubel et al, CURRENT PROTOCOLS IN MOx^ECULAR 
5 BIOLOGY (John Wiley & Sons, Inc. 1994-1998). in particular Chapters 10 and 16; and 
CoUgan et aL, CURRENT PROTOCOLS IN PROTEIN SCIENCE (John Wiley & Sons, 
Inc. 1995-1997), in particular Chapters 1, 5 and 6. 

Alternatively, the polypeptide, fragments, variants or derivatives of the invention 
may be synthesised using solution synthesis or solid phase synthesis as described, for 
10 example, in Chapter 9 of Atherton and Shephard (supra) and m Roberge et al (1995, 
Science 269: 202), 

6» A n tigen-binding molecules 

The invention also contemplates antigen-binding molecules that bind specifically 
to the aforementioned polypeptides, fragments, variants and derivatives. Preferably, an 
1 5 antigen-binding molecule according to the invention is immuno-interactive with any one or 
more of the amiuo acid sequences set forth in SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72, 74, 76, 78, 80, 83, 87, 89, 91, 93, 95, 99, 101, 103, 105 and 107, or variants thereof. 

For example, the antigen-binding molecules may comprise whole polyclonal 
20 antibodies. Such antibodies may be prepared, for example, by mjecting a polypeptide, 
fragment, variant or derivative of the invention into a production species, which may 
include mice or rabbits, to obtain polyclonal antisera. Methods of producing polyclonal 
antibodies are well known to those skilled in the art. Exemplary protocols which may be 
used are described for example in Coligan et al, CURRENT PROTOCOLS IN 
25 IMMUNOLOGY, (John Wiley & Sons, Inc, 1991), and Ausubel et al, (1994-1998, supra\ 
in particular Section HI of Chapter 11. 

In lieu of the polyclonal antisera obtained in the production species, monoclonal 
antibodies may be produced using the standard method as described, for example, by 
Kohler and Milstein (1975, Nature 256, 495-497), or by more recent modifications thereof 
30 as described, for example, in Coligan et al, (1991, supra) by immortalising spleen or other 
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antibody producing cells derived from a production species which has been inoculated with 
one or more of the polypeptides, fragments, variants or derivatives of the invention. 

The invention also contemplates as antigen-binding molecules Fv, Fab, Fab' and 
F(ab')2 immunoglobulin fragments. Alternatively, the antigen-binding molecule may be in 

5 the form of a synthetic stabilised Fv (scFv) fragment, a disulphide stabilised Fv (dsFv) 
fragment, a diabody (dAb), a minibody and the like, or may comprise non-immunoglobulin 
derived, protein frameworks. The antigen-binding molecules of the invention may be used 
for affinity chromatography in isolating a natural or recombinant polypeptide or 
biologically active fragment of the invention. For example reference may be made to 

10 immunoaffinity chromatographic procedures described in Chapter 9.5 of Coligan et ai, 
(1995-1997, supra). The antigen-binding molecules can be used to screen expression 
Ubraries for variant polypeptides of the invention as described herein. They can also be 
used to detect polypeptides, fragments, variants and derivatives of the invention as 
described hereinafter. 

15 7. Identification of modulators 

The invention also contemplates a method of screening for an agent that 
modulates the expression of a gene .selected from xabB, xabA, or xabC, or a gene 
belonging to the same regulatory or biosynthetic pathway as xabB, xabA, or xabC, or a 
variant of that gene, or that modulates the level and/or functional activity of an expression 

20 product of that gene or its variant. The method comprises contacting a preparation 
comprising said expression product (e.g., polypeptide or transcript), or a biologically active 
fragment thereof, or variant or derivative of these, or a genetic sequence that ml dulates the 
expression of said gene (e.g., the natural promoter relating to said gene, e.g., the xabB 
promoter, comprising the sequence set forth in SEQ ID NO: 81 or complement thereof), 

25 with a test agent, and detecting a change in the level and/or functional activity of said 
polypeptide or biologically active fragment thereof, or variant or derivative, or cf a product 
expressed from said genetic sequence. 

Modulators contemplated by the present invention includes agonists and 
antagonists of gene expression include antisense molecules, ribozymes and co-suppression 
30 molecules, as for example described in Section 2. Agonists include molecules which 
increase promoter activity or interfere with negative mechanisms. Agonists of a gene 
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include molecules which overcome any negative regulatory mechanism. Antagonists of 
polypeptides encoded by a gene of interest include antibodies and inhibitor peptide 
fragments. 

Candidate agents encompass numerous chemical classes, though typically they are 
5 organic molecules, preferably small organic compounds having a molecular weight of 
more than 50 and less than about 2,500 Dalton, Candidate agents comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen bonding, 
and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at 
least two of the functional chemical groups. The candidate agents often comprise cyclical 
10 carbon or heterocyclic structures and/or aromatic or polyaromatic structures substituted 
with one or more of the above functional groups. Candidate agents are also found among 
biomolecules including, but not limited to: peptides, saccharides, fatty acids, steroids, 
purines, pyrimidines, derivatives, structural analogues or combinations thereof. 

Small (non-peptide) molecule modulators of a polypeptide according to the 
1 5 invention, or portion, or domain or module thereof are particularly preferred. In diis regard, 
small organic molecules typically have the ability to gain entry into an appropriate cell and 
affect the expression of a gene (e.g., by interacting with the regulatory region or 
transcription factors involved in gene expression); or affect the activity of a gene by 
inhibiting or enhancing the binding of accessory molecules. 

20 Alternatively, Ubraries of natural compounds in the form of bacterial, fungal, plant 

and animal extracts are available or readily produced. Additionally, natural or synthetically 
produced libraries and compounds are readily modified through conventional chemical, 
physical and biochemical means, and may be used to produce combinatorial libraries. 
Known pharmacological agents may be subjected to directed or random chemical 

25 modifications, such as acylation, allcylation, esterification, amidification, etc. to produce 
structural analogues. Screening may also be directed to known pharmacologically active 
compounds and chemical analogues thereof. 

Screening for modulatory agents according to the invention can be achieved by 
any suitable method. For example, the method may include contacting a cell comprising a 
30 polynucleotide corresponding to a gene as defined above, with an agent suspected of 
having said modulatory activity and screenmg for the modulation of the level and/or 
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functional activity of a protein encoded by said polynucleotide, or the modulation of the 
level of an expression product encoded by the polynucleotide, or the modulation of the 
activity or expression of a downstream cellular target of said protein or said expression 
product. Detecting such modulation can be achieved utilising techniques including, but not 
5 restricted to, ELISA, cell-based ELIS A, filter-binding ELIS A, inhibition BUS A, Western 
blots, irmnunoprecipitation, slot or dot blot assays, immunostaining, RIA, scintillation 
proximity assays, fluorescent immunoassays using antigen-binding molecule conjugates or 
antigen conjugates of fluorescent substances such as fluorescem or rhodamine, 
Ouchterlony double diffusion analysis, immunoassays employing an avidin-biotin or a 
10 streptavidin-biotin detection system, and nucleic acid detection assays including reverse 
transcriptase polymerase chain reaction (RT-PCR). 

It will be understood that a polynucleotide firom which a target molecule of 
interest is regulated or expressed may be naturally occurring in the cell which is the subject 
of testing or it may have been introduced into the host cell for the purpose of testing. 

15 Further, the naturally-occurring or introduced sequence may be constitutively expressed - 
thereby providing a model useful in screening for agents which down-regulate expression 
of an encoded product of the sequence wherein said down regulation can be at the nucleic 
acid or expression product level - or may require activation - thereby providing a model 
useful in screening for agents that up-regulate expression of an encoded product of the 

20 sequence. Further, to the extent that a polynucleotide is introduced into a cell, that 
polynucleotide may comprise the entire coding sequence which codes for a target 
polypeptide or it may comprise a portion of that coding sequence (e.g. a domain or module 
as herein described) or a portion that regulates expression of a product encoded by the 
polynucleotide (eg, a promoter). For example, the promoter that is naturally associated 

25 with the polynucleotide {ie. the xabB promoter) may be mtroduced into the cell that is the 
subject of testing. Li this regard, where only the promoter is utiUsed, detecting modulation 
of the promoter activity can be achieved, for example, by operably linking the promoter to 
a suitable reporter polynucleotide including, but not restricted to, green fluorescent protein 
(GFP), luciferase, B-galactosidase and catecholamine acetyl transferase (CAT). Modulation 

30 of expression may be determined by measuring the activity associated with the reporter 
polynucleotide. 
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In another example, the subject of detection could be a downstream regulatory or 
biosynthetic target of the target molecule, rather than target molecule itself or the reporter 
molecule operably linked to a promoter of a gene encoding a product the expression of 
which is regulated by the target protein. 

5 These methods provide a mechanism for performing high throughput screening of 

putative modulatory agents such as proteinaceous or non-proteinaceous agents comprising 
synthetic, combinatorial, chemical and natural libraries. These methods will also facilitate 
the detection of agents which bind either the polynucleotide encoding the target molecule 
or which modulate the expression of an upstream molecule, which subsequently modulates 
10 the expression of the polynucleotide encoding the target molecule. Accordingly, these 
methods provide a mechanism of detecting agents that either directly or indirectly 
modulate the expression and/or activity of a gene or expression product according to the 
invention. 

8. Production of secondary metabolites 

15 The present invention fiirther relates to a process for enhancing the kvel and/or 

functional activity of secondary metabolites, preferably albicidins, using one or more 
agents selected from the polynucleotides, polypeptides, fragments, variants, derivatives, 
vectors and modulatory agents described above. The process in a prefenred embodiment, 
includes the steps of stably transforming a host cell with an expression vector as broadly 

20 described above, comprising at least one nucleic acid sequence encoding a polypeptide of 
the invention or a biologically active fragment or variant or derivative of these and 
isolating transformants which produce an enhanced amount of antibiotics, which are 
preferably of the albicidin class. The vector optionally comprises a signal sequence for 
secretion recognised by the host cell. Illustrative secretory leaders include the secretory 

25 leaders of penicillinase, a-factor, immunoglobulin, T-cell receptors, outer membrane 
proteins, glucoamylase, ftmgal amylase and the like. By ftision in proper reading frame, the 
mature polypeptide may be secreted into the medium. The host cell may be a eukaryote or 
a prokaryote cell. In one embodiment, the cell naturally produces polyketides, preferably 
antibiotic polyketides and, in this regard, the cell is preferably X, albilineans or other 

30 bacteria capable of producing albicidins. Optionally, the construct may include a 
transcription regulating sequence, which is not subject to repression by substances present 
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in the growth medium. The above process may be used to prepare antibiotics directly or 
they may be used to prepare cell free extracts containing increased quantities of antibiotics, 
preferably of the albicidin class, for in vitro preparation of said antibiotics. Siiitably, these 
cell free extracts may be prepared for example using the method disclosed by Dobrogosz, 
5 WJ. (1981) Enzymatic activity. In Manual of Methods for General Bacteriology 
(Gerhardt, P., ed) Washington, DC: American Society for Microbiology, pp. 365-392. In a 
preferred embodiment, a vector from which a phosphopantethemyl transferase (PPTase) 
can be translated is also introduced into the host cell. Expression of PPTase 
polynucleotides has been shown to be important for the production of polyketides in 
10 heterologous expression systems. Preferably, the PPTase is selected from EntD and/or 
XabA as for example disclosed herein. If desired, a vector from which a methyltransferase, 
more preferably and 0-methyltransferase, and even more preferablv an S- 
adenosyhnethionine O-methyltransferase can be translated may also be introduced into the 
host cell. An exemplary methyltransferase for this purpose is XabC as described herein. 

15 Alternatively, the expression hosts may be used as a source of increased quantities 

of antibiotics, which can be subsequently purified as for example disclosed by Birch ei al. 
in U.S. Patent No. 4,525,354. 

The invention also contemplates use of the polynucleotides, polypeptides, 
fragments, variant and derivatives of the invention in methods of combinatorial 

20 biosynthesis of novel antibiotics as for example disclosed by Khosla et al in U.S. Patent 
No. 5,712,146, Peterson et al in U.S. Patent No. 5,783,431 and Betlach et al m U.S. 
Patent No. 6,251,636 or in methods of producing antibiotics in hosts that ordinarily do not 
produce them as for example disclosed by Barr et al in U.S. Patent No. 6,033,883. As 
discussed in Section 2.4, the invention contemplates albicidin PKS-NRPS derivatives with 

25 altered activities in one or more respects for the production of polyketides other than the 
albicidin natiual product(s) of the XabB. In this regard, expression vectors containing 
nucleotide sequences encoding a variety of such derivatives for the production of different 
polyketides are transformed into the appropriate host cells to construct a library. In one 
embodiment, a mixture of such vectors is transformed into selected host cells and the 

30 resulting cells plated into individual colonies and selected to identify successful 
transformants. A variety of strategies is available to obtain a multiplicity of colonies each 
containing a PKS gene cluster derived from the naturally occurring host gene cluster so 
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that each colony in the library produces a different PKS and ultimately a different 
polyketide, as for example disclosed by Betlach et al in U.S. Patent No. 6,251,636. The 
libraries thus produced can be considered at four levels: (1) a multipUcity of colonies each 
with a different PKS-NRPS encoding sequence; (2) the proteins produced from the coding 

5 sequences; (3) the polj^etides produced from the proteins assembled into a functional 
PKS-NRPS; and (4) antibiotics or compounds with other desired activities derived from 
the polyketides. Colonies in the library can be induced to produce the relevant synthases 
and thus to produce the relevant polyketides to obtain a library of polyketides. Polyketides 
that are secreted into the media or have been otherwise isolated can be screened for 

10 binding to desired targets, such as receptors, signalling proteins, and the like. The 
supematants per se can be used for screening, or partial or complete purificrtion of the 
polyketides can first be effected. Typically, such screening methods involve detecting the 
binding of each member of the Ubrary to receptor or other target ligand. Binding can be 
detected either directly or through a competition assay Means to screen such Ubraries for 

15 binding are well known in the art. Alternatively, individual polyketide members of the 
library can be tested against a desired target. In this event, screens wherein the biological 
response of the target is measured can more readily be included. Antibiotic activity can be 
verified using typical screening assays such as those for albicidin set forth in Example 1. 

The invention also extends to the use of the polynucleotides, polypeptides, 
20 fragments, variant and derivatives of the invention for the synthesis of antibiotics, 
preferably antibiotics of the albicidin class. 

The polynucleotides of the invention encoding XabB, or a biologically-active 
fragment or variant thereof, together with a recombinant polynucleotide encoding a PPTase 
and/or an 0-methyltransferase which participate or which are capable of participating in 
25 the albicidin biosynthetic pathway, provide the means to engineer high level co-expression 
of the albicidin synthetase, its activating PPTase and modifying methyltransferase to obtain 
higher yields of albicidins. 

In order that the invention may be readily understood and put into practical effect, 
particular preferred embodiments wiU now be described by way of the following non- 
30 limiting examples. 
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EXAMPLES 
EXAMPLE 1 

Albicidin multifunctional synthase sene 

Materials and Methods 

5 Bacterial strains and plasmids 

The properties of bacteria and plasmids used in this example are listed in Table 1. 

Media> culture conditions and antibiotics 

X albilineans strains were routinely cultured on SP medium (Birch & Patil, 
1985b) at 28'' C. Escherichia coli DH5a and JM109 were used as hosts in cloning 

10 experiments and were grown on LB medium at 37° C (Sambrook et al, 1989). Broth 
cultures were aerated by shaking at 200 r.p.m. on an orbital shaker. Modified YEB medium 
(Van Larebeke et aly 1977) for patch mating consisted of 10 mg ml"^ peptone, 5 mg mL'^ 
yeast extract, 5 mg mL'^ NaCl, 5 mg mL"^ sucrose and 0.5 mg mL'^ MgS04.7H20. The 
following antibiotics were added to media as required: 50 |ig kanamycin mL'^; 15 |ig 

1 5 tetracycline mLl'' ; 1 00 [xg ampicillin mL'^ . 



Routine genetic procedures 

Bacterial genomic DNA and plasmid DNA isolation, gel electrophoresis, DNA 
restriction digests, ligation reactions and transformation were performed by routine 
procedures (Sambrook et al, 1989). DNA fragments were excised firom agarose gels and 
20 residual agarose was removed with the BRES Aclean™ DNA purification kit (GeneWorks, 
Adelaide). 

Constmction of a X. albilineans partial genomic library 

Genomic DNA from X. albilineans XaI3 was digested with EcoKL and size- 
fractionated. DNA fragments of 15 to 20 kb were ligated to dephosphorylated EcoBl- 
25 cleaved pBluescript SK n. The ligated DNA was electroporated into E. coU TOPIO. 
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Transfonnaats were selected on LB agar medium containing ampicillin, and stored in LB 
broth with 15% glycerol at -70°C. 

PGR amplification 

5amHI-digested genomic DNA fix>m X, albilineans LSI 57 was religated at low 
5 concentration (0.5 |ig/niL) to generate circular DNA molecules as templates for inverse 
PGR. Three primers, one from, the IS terminal region of Tn5 (IR2: 5- 
GGGGATGGTGACATGGAAG TCAGATCCTG-3*), and two flanking the unique BamHI 
restriction site of Tn5 (BL: 5'-GGGGACCTTGCACAGATAGC-3', and BR: 5'- 
CATTCCTGTAGCGGATGGAGATC-3'), were used to amplify the sequences flanking 
10 the Tn5 insertion in the genome of LS157. The amplified fragments (1.4-kb and 6.0-kb) 
were cloned into pZErO-2, yielding pZIL and pZIR (Figure 1). 

PGR was performed m a volume of 50 |il with 200 ng of genomic DNA (or 10 ng 
of plasmid DNA), 0.4 ng/pL of each of primer, 0.2 mM of each dNTP, 1.8 mM Mg^"", and 
1 unit of elongase enzyme mix (Life Technologies). A 10-min initial denaturation step at 
15 94*^ C was followed by 35 thermal cycles of denaturation at 94*" C for 1 min, annealing at 
55° C for 1 min, and extension at IT" C for 1 min per 1 kb of expected amplification. 

Construction of promoter probes and glucuronidase assay 

Plasmid pRG960sd contains a promoterless j3-glucuronidase gene {uidA) 
downstream of a multiple cloning site (Van den Edde et al, 1992). Sequence upstream of 

20 xabB (nucleotide residues 1005 to 1210 or 521 to 1210) was amplified from pLXABB by 
PGR. Forward primer PlFl f5'-ACGC GGATCC CAGCAGGGTGTGATACAGG-3'), or 
P1F2 (5'-TCGCGGATCC_GCGCGATTGAAGTAGTCC-3») contained a BamHI 
restriction site (underlined). Reverse primer PIR (5- 
TCC CCCGGG CGGCCAGCGTGGTGCTACTAC-30 introduced a Xmal restriction site 

25 (underlined). PGR fragments were ligated into BamHUXmal-cai pRG960sd, yielding 
pRG960pl and pRG960p2. These constructs were mobilised from E. coli DH5a into X. 
albiUneans LSI 55 as described below. 



wo 02/24736 



PCT/AUOl/01190 



-77- 

Promoter strength was quantified by fluorometric analysis of glucuronidase 
activity (Jefferson, 1987; Xiao et al, 1992). The protem content in cell lysates was 
determined by the dye-binding method (Bradford, 1976) using a Bio-Rad protein assay kit. 

Bacterial conjugation 

5 DNA transfer between E, coli donor (JM109 pLAFR3 ± insert, or DH5a 

pRG960sd ± insert) and albilineans recipient (LS157 or LS155) was accomplished by 
triparental transconjugation with helper stram pRK2013. Mid-log-phase cultures of the 
recipient were spotted onto agar plates containing YEB medium with no antibiotics (20 
per spot). After the Uquid was absorbed by the agar, 20 |iL of mid-log-phase culture of the 

10 helper was added to each spot. The liquid was again allowed to absorb, and 20 fiL of mid- 
log-phase culture of the donor was added to each spot. After incubation of the mating 
plates overnight at 28'' C, transconjugants were selected on SP plates supplemented with 
ampicillin, and tetracycline or spectinomycin. 

Assay and quantification of albicidin production 

15 Albicidin was quantified by a microbial plate bioassay as described previously 

(Birch and Patil, 1985b), except that the 10 mL basal layer of LB agar and the 5mL 
overlayer of 50% LB with 1% agar were supplemented with tetracycUne or spectinomycin, 
and E. coli DH5a pLAFR3 or pRG960sd was used as the indicator strain. This change 
avoided interference by tetracycline or spectinomycin, which were added to some cultures 

20 to ensure retention of pLAFR3 or pRG950sd derivatives in X, albilineans. Inhibition zone 
widths in the bioassay were converted to albicidin concentrations by interpolation on a 
dose-response plot produced under the same assay conditions. The plot fits the formula: 
Log [Alb] = 0.3 W - 0.92, where [Alb] is units of albicidin per 20 |aL sample assayed, and 
W is the width in millimetres of the zone of grovrth inhibition surrounding each well. 

25 Results 

Cloning and sequencing oixabB gene required for albicidin production 

Xanthomonas albilineans Tox' mutant LSI 57 contains a single Tn5 insertion, in a 
4.1 kb Clal restriction fragment or a 16.5 kb Ecd91 restriction firagment (Figure 1). 
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S election for kanamycin resistance, following shotgun cloning of Clal restriction 
fragments of LSI 57 DNA into pBluescript II SK, yielded clone pBC157. Sequences 
flanking the Tn5 insertion in LSI 57 DNA were amplified by inverse PGR, and ■ :loned into 
pZErO-2, producing pZIL and pZIR. Plasmid pLXABB was screened from a X. 
5 albilineans Xal3 EcoRI genomic library with probes described in Figure IB. Subclones 
pSEBL and pSEBR were derived from pLXABB (Figure IC, Table 1). 

The double-strand sequence of thel6,511 bp EcoRI genomic fragment in 
pLXABB was obtained by a primer-walking approach, using subclones pBC157, pZIL, 
pzm, pSEBL, and pSEBR. The Tn5 insertion in the genome of LSI 57 is accompanied by 
10 9-bp perfect repeat sequence (GTCCTGAAG), commencing at 2490 bp in GenBank 
accession no. AF239749. 

The only ORF longer than 900 bp within the 1 6.5-kb fragment is disrupted by the 
Tn5 insertion. This ORF (designated xabB) encodes a protein of 4081 aa (Mr 525,695). It 
commences at 1230 bp in GenBank accession no. AF239749 with a TTG codon, 6 bp 

15 downstream from a ribosome binding sequence (BBS) GAGG, which may impose post- 
transcriptional control on the rate of gene product formation (McCarthy and Gualerzi, 
1990). There is an alternative start codon (ATG) a further 15 bp downstream. Of the 
codons in this ORF, 8.5% are rarely used in E. coli. The closest match (TTGAGC-14x- 
TATAAC) to the consensus -35 (TTGACA) and -10 (TATAAT) sequences for coli 

20 promoters occurs 1 17 bp upstream of the translation initiation codon (Figure 2). 

Downstream by 35 bp from the TAG stop codon of xabB is a probable RBS 
(GAGG), separated by 6 bp from the ATG start codon of another ORF (designated xabQ 
in the same orientation as xabB, Overlappmg the xabB promoter region is another probable 
promoter for a divergent transcript including a putative RBS (TGGAGG) and start codon 
25 for a gene designated xatA, separated by 233 bp from xabB (Figure 1, 2). 

Complementation of xabB gene in LSI 57 

MobiUsation of pLAFR3, pLXABBl or pLXABB2 by bacterial conjugation into 
Tox" mutant LS157 occurred at a frequency of 1.5 x 10*^ transconjugants/recipient cells. 
Albicidin production was undetectable in Tox" mutant LSI 57 and LSI 57 (pLAFR3) 
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controls, but introduction of the xabB gene on pLXABBl or pLXABB2 restored albicidin 
production to the level of the wild-type parental strain LSI 55 (Figure 4). 

Functional analysis of xabB promoter region 

GUS activity was undetectable in LS155 and LS155 (pRG960sd) controls. 
5 Plasmid pRG960pl or pRG960p2, with 206 bp or 690 bp from the xabB promoter region 
upstream of GUS, both conferred GUS activity with no difference in expression level or 
pattern in X. albilineans LS155 (Figure 5). 

Discussion 

Albicidin was partially characterised as a low-molecular-weight compound" that 
10 contains 38 carbon atoms with 3-4 aromatic rings (Birch and Patil, 1985a). The compound 
is not degraded by peptidases (Birch and Patil, 1985a), but it is cleaved by the AlbD 
esterase (Zhang and Birch, 1997). Based on the deduced functionality of the synthase 
describe herein, albicidin is likely to be a complex polyketide, condensed with amino 
acid(s), or nonproteinogenic amino, hydroxyl and carboxyl acid(s) by C-N, amide or ester 
15 bond formation. 

The characterisation of XabB as a multi-modular hybrid enzyme provides new 
insights into the mechanism of albicidin biosynthesis and possible approaches to engineer 
the overproduction of albicidins. For example, the complementation experiments (Figure 
4) indicate that increased copy number oixabB stimulates early production of albicidin, 

20 but other factors become limiting during idiophase. It may be possible to increase 
expression of the albicidin synthase by modifications to the promoter and TTG start codon, 
or to improve albicidin yields by supplying candidate substrates (such as shikimate-derived 
units). The unusual enzyme organisation also contributes to the emerging understanding of 
how microbes generate structural diversity of antibiotics, and can faciUtate combinatorial 

25 engineering of antibiotics of mixed peptide/polyketide origin. 
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EXAMPLE! 

Albicidin Antibiotic and Phvtotoxin Biosynthesis in Xanthomonas albilineans Requires a 
Phosvhovantetheinvl Transferase Gene 

Materials and Methods 

5 Bacterial strains and plasmids 

The properties of bacteria and plasmids used in this Example are listed in Table 3. 

Media, culture conditions and antibiotics 

X, albilineans strains were routinely cultured on SP medium (Birch & Patil, 
1985b) at 28° C. Escherichia coli DH5a and JM109 were used as hosts in cloning 

10 experiments and were grown on LB medium at 3T C (Sambrook et al, 1989). Broth 
cultures were aerated by shaking at 200 r.p.m. on an orbital shaker. Modified YEB medium 
(Van Larebeke et al, 1977) for patch mating consisted of 10 mg ml"^ peptone, 5 mg mL'^ 
yeast extract, 5 mg mL'^ NaCl, 5 mg mL"^ sucrose and 0.5 mg mL*^ MgS04.7H20. The 
following antibiotics were added to media as required: 50 ixg kanamycin mL"^; 15 ^ig 

1 5 tetracycline mLl"^ ; 1 00 |j,g ampicillin mL"^ . 

Assay of albicidin production 

Albicidin was quantified by a microbial plate bioassay as described previously 
(Birch and Patil, 1985b), except that the 10 mL basal layer of LB agar and the 5 mL 
overlayer of 50% LB with 1% agar were supplemented with tetracycline, and E. coli 
20 DH5a [pLAFR3] was used as the indicator strain. This change avoided interference by 
tetracycline, which was added to some cultures to ensure retention of pLAFR3 lerivatives 
in^ albilineans. 

Routine genetic procedures 

Bacterial genomic DNA and plasmid DNA isolation, gel electrophoresis, DNA 
25 restriction digests, hgation reactions and transformation were performed by routine 
procedures (Sambrook et ai, 1989). DNA firagments were excised fi-om agarose gels and 
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residual agarose was removed wth the BRESAclean™ DNA purification kit (GeneWorks, 
Adelaide). 

DNA sequencing and analysis 

Sequencing reactions were performed by dideoxynucleotide chain termination 
5 (Sanger et aL, 1977) using the BigDye™ Tenninator Cycle Sequencing Kit and 373A 
DNA sequencer (PE Applied Biosystems) through the Australian Genome Research 
Facility. Oligonucleotide primers were purchased from GeneWorks (Adelaide). University 
of Wisconsin Genetics Computer Group (UWGCG) programs BLASTP, FASTA, PILEUP, 
and BESTFIT were used through WebANGIS version 2.0 for DNA and protein sequence 
1 0 analyses of the GenBank, EMBL, PIR and S WISSPROT databases using standard defaults. 

Cloning of Tn5 flanking sequences 

EcdRL-digestcd genomic DNA from X. albilineaiis Tox* mutant LSI 56 was 
ligated into pBluescript n SK and electroporated into E. coli DH5a. Transfomiants were 
selected on LB medium containing kanamycia and ampicillin, yielding clone pBEAl, from 
1 5 which subclones pCEAl and pPEAl were obtained (Figure 1). 

Amplification of sequences from wild-type LSI 55 by PCR 

Sequences flanking the Tn5 insertion in LS156 were used to design primers (AlF: 
5'-TTTGGGTTGGATCGGGTAG-3' and AIR: 5'-CCTTCTCGTCCTTG CTCTTC-3') 
for PCR-amplification of the corresponding wild type A! albilineans LSI 55 chromosomal 

20 DNA. PCR was performed in a volume of 50 |J,L with 200 ng of genomic DNA, 0.4 ng 
|aU^ of each of primer, 0.2 mM of each of dNTP, 1.8 mM Mg^^, and 1 unit of elongase 
enzyme mix (Life Technologies). A 4-min initial denaturation step at 94° C W25 followed 
by 35 thermal cycles of denaturation at 94° C for 1 min, annealing at 55° C for 1 min, and 
extension at 72° C for 2 min. The amplified DNA fragment was cloned into pGEM-T to 

25 give pGTAl (Figure 1). 

Constniction of expression vectors 

The coding region of the xabA gene was amplified from pGTAl by PCR. Primer 
AlFl (5'-GGAATICCATGCCCAATGCCGTACCG-3') contained an EcdSl restriction 
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site (underlined) for insertion of the amplified gene into the correct reading firame oflacZ 
in PLAFR3. Primer AlRl (5'-CGGGATCCCGTGCTCACCAGGCGTAGTGG-3') 
introduced a Bamm restriction site (underlined), 5 bases downstream from the stop codon 
of the amplified gene. The amplified DNA fragment was digested with EcoBl and Banim, 
and ligated with £:coRI/5a/?iHI-digested pLAFR3 to result in pl^XAB A 

Similarly, the coding region of the entD gene was PCR-amplified from E. coli 
DH5a by colony PGR usmg primers EntDF (5'- 
TCCCGGAATTCCATGGTCGATATGAAAACTACGC-3') and EntDR (5'- 
GCCCAAGCTTCTAATCGTGTTGGCACAGCGTTATG-3'). then ligated into pLAFR3 
to produce pLENTD. The inserts in pLXABA and pLENTD were sequenced to confirm 
the expected clones. 



Bacterial triparental mating 

DNA transfer between E. coli donor (JM109 pLAFR3 ± insert) and X. albilineans 
recipient (LS155 or LS156) was accomplished by triparental transconjugation v/ith helper 

15 strain pRK2013. The mid-log-phase cultures of the recipient were spotted onto agar plates 
containing YEB medium with no antibiotics (20 ^iL per spot). After the liquid was 
absorbed by the agar, 20 of mid-log-phase culture of the helper was added to each spot. 
The liquid was again allowed to absorb, and 20 ^1 of mid-log-phase culture of the donor 
was added to each spot. After incubation of the mating plates overnight at 28° C, 

20 transconjugants were selected on SP plates supplemented with tetracycline and ampicillin. 



Results 



Cloning and sequencing of the xabA eene req uired for albicidin production 

Xanthomonas albilineans Tox' mutant LSI 56 contains a single Tn5 insertion, in a 
3.0-kb EcoRI restriction fragment (Wall & Birch, 1997). Selection for Tn5-encoded 
25 kanamycin resistance, following shotgun cloning ofEcdSl restriction fragments of LS156 
DNA into pBluescript II SK, yielded pBEAl (Figure 8). 

Both strands of the insert in pBEAl excluding the Tn5 insertion were sequenced 
by primer-waUdng from T3 and T7 vector sequences in pBEAl and subclones pCEAl and 
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pPEAl. The corresponding genomic region was amplified from wild-type X. albilineans 
LS155 by PGR, and cloned into pGEM-T to give pGTAl. Sequencing of pGTAl revealed 
that a 9-bp imperfect repeat sequence (TTGGCCACG) in the genome of LSI 56 
accompanied the Tn5 insertion (following base number 1869 in Figure 9). The double- 
5 strand nucleotide sequence of the 2989 bp wild type Ecd^ fragment is deposited in 
GenBank imder accession no. AF191324. 

Reading frame analysis of the 3 kb EcdSl fragment revealed that only one ORF 
(designated xabA) is disrupted by the Tn5 insertion. This ORF encodes a protein of 278 aa 
(Mr 29 277), with 6.12% codons rarely used in E, colL There were no close matches to E. 

10 coli -10 (TATAAT) and -35 (TTGACA) consensus promoter sequences, and no 
appropriately spaced RBS sequence (such as AGGA or GAGG) in the region upstream of 
the putative start codon ATG (Figure 9). A region of GC-rich dyad symmetry with a free 
energy of -10.2 kcals/mol was found, followed by two TCTC boxes that closely resemble 
the TCTG consensus sequence characteristic of many factor-independent termination sites 

15 (Brendel & Trifonov, 1984; Piatt, 1986) downstream of the TGA termination codon of 
xabA. 

Comparison of XabA with other bacterial PPTases 

A search for proteins with homology to the deduced xabA product, using the 
FASTA and BLASTP and SWISSPROT programs, indicated regions of similarity to EntD 

20 from Escherichia coli (170 aa overlap, 35.9 % identity, 56.5 % similarity). Shigella 
flexneri (180 aa overlap, 35.0 % identity, 55.6 % similarity), Salmonella typhimuritim (184 
aa overiap, 35.9 % identity, 62.0 % similarity), and Salmonella austin (172 aa overlap, 
36.1 % identity, 61.1 % similarity). XabA contains (V/I)G(V/I)D and 
(FAV)(S/C/T)xKE(S/A)xxK domams characteristic of the phosphopantetheinyl transferase 

25 (PPTase) superfamily, and shares 17-36 % overall identity, 39-62 % overall similarity, 
with other bacterial PPTases (Table 4). 

Enhanced expression of xabA bv complementation in LSI 56 results in increased 
production of albicidins 

Mobilisation of pLAFR3 or pLXABA (pLAFR3::jcaM) by triparental matings 
30 into Tox* mutant LS156 occurred at a frequency of 1.5 x 10'^ transconjugants/recipient 
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cells. Albicidin production was undetectable in Tox' mutant LS156 and LS156 (pLAFR3) 
controls, but introduction of the xabA gene on pLXAB A enhanced albicidin production 
restored albicidin production (Figure 10). In LS156 (pLXABA), as in LS155, albicidin was 
first detectable in late-log-phase cultures (OD550 = 0.7) and was maximal in stationary 
5 phase. Albicidin production was not responsive to IPTG or glucose, and the lac promoter 
driving xabA in pLXABA is considered to express constitutively in X albilineans. The E. 
coli entD gene, expressed firom the lac promoter in pLENTD, also complemented the 
;caM::Tn5 mutation, restoring albicidm production in LS156. 

Discussion 

10' A gene requured for albicidin production in X, albilineans was isolated using a 

Tn5 mutagenesis and shotgun cloning approach. The ORF mterrupted by Tn5 in Tox' 
mutant LSI 56 is designated xabA, This ORF was isolated firom Tox"*" parent strain LSI 55, 
and shown to enhance albicidin production early in the production phase in LSI 56 when 
expressed firom the lac promoter in pLAFR3. Tn5 insertions typically cause polar 

15 mutations affecting all downstream cistrons in an operon (De Bruijn and Lupski, 1984). 
Complementation of the mutation m LSI 56 by the isolated xabA ORF indicates the 
absence of any downstream cistron iuvolved in albicidin production. There is no consensus 
RBS sequence close to the alternative start codons for this ORF in the X, albilineans 
genome. Translation may be initiated witiiout an evident ribosome binding sequence 

20 complementary to the 3' end of the 16S rRNA, as observed for some streptomycete genes 
involved in secondary metabolism (Strohl, 1992), and for some chloroplast genes (Kozak, 
1999). 

PPTases play an essential role in priming polyketide, fatty acid, non-ribosomal 
peptide and siderophore biosynthesis (Gehring et al, 1997a; Lambalot et al., 1996; 

25 Marahiel et al, 1997; Walsh et al, 1997). All polyketide synthase, fatty acid synthetases, 
and non-ribosomal peptide synthetases require post-translational modification to become 
catalytically active (Walsh et aL, 1997). The inactive apo-proteins are converted to their 
active holo-fonns by transfer of the 4'-phosphopantetheinyl (P-pant) moiety of coenzyme 
A to the sidechain hydroxyl of a serine residue in a conserved carrier domain (Lambalot et 

30 a/., 1996; Walsh et al, 1997). The P-pant moiety serves to covalently tether the growing 
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product, which is assembled by sequential action of multiple catalytic domaiis in these 
complex synthetases (Walsh et al, 1997). 

A family of more than twenty PPTases is recognised by a common 
(V/I)G(V/I)Dx40-45 . . .(FAV)(S/C/T)xKE(A/S)xxK signature sequence, but overall 
5 sequence homologies are low (Gehring et al, 1997; Lambalot et al, 1996; Nakano et a/., 
1992; Quadri et al, 1998a). In E. coli, there are two PPTases with distinct specificities: 
ACPS is active on acyl carrier protein (ACP) domains in fatty acid and polyketide 
synthase; EntD is active on peptidyl carrier protein (PGP) and aryl carrier protein (ArCP) 
domains in peptide synthetases (Lambalot et aL, 1996; Walsh et al, 1997). Thus, PPTases 

10 may be partner-protein specific. However, firom B. subtilis appears to be non-specific, 
efficientiy activating both fatty acid, polyketide synthase and peptide synthetases (Kealey 
et aL, 1998; Mofid et aL, 1999; Quadri et aL, 1998a). XabA includes the PPTase VGID 
and FSxKESxxK motifs. Although it has highest overall similarity to the peptide-selective 
EntD proteins, the sequence groupings are not sufficiently compelling to predict the 

1 5 specificity of XabA for polyketide synthase or peptide synthetases (Table 4, Figure 1 1). 

Complementation studies have revealed substantial functional interchangeability 
of PPTases in different bacteria. For example, the A sublitis sjp gene involved in surfactin 
biosynthesis complements mutants in E, coli entD (enterobactin biosynthesis) ai.d^. brevis 
gsp (gramicidin biosynthesis) (Borchert et aL, 1994; Grossman et aL, 1993). In vitro, 

20 ACPS fi*om E, coli activates apoproteins fi-om Lactobacillus, Rhizobium and Streptomyces 
(Lambalot et aL, 1996). Because XabA shows highest similarity to EntD, we amplified the 
entD-co6mg region firom E, coli, and arranged it for expression firom the lac promoter in 
broad host-range vector pLAFR3. This construct (pLENTD) restored albicidin production 
in X. albilineans xabAwTnS mutant LS156. EntD is a peptide-selective PPTase that 

25 converts inactive apo-EntF and apo-EntB to active holo-enzymes involved in biosynthesis 
of enterobactin in E, coli (Gehring et aL, 1997a). Functional complementadon of the 
xabAvJnS mutation by entD indicates tiiat XabA is a PPTase required for post- 
translational activation of synthetases involved in albicidin production in Z albilineans. 
The specificity of EntD for activation of peptide synthetases in E. coli indicates that 

30 albicidin biosynthesis probably involves an XabA-activated peptide synthetase. 



wo 02/24736 



PCT/AUOl/01190 



-86- 

Some PPTase genes involved in non-ribosomally synthesised peptide biogenesis 
are located near the genes encoding their targets (Quadri et al, 1998b). For example, B. 
brevis gsp, B. sublitis sjp, and E, coli entD genes all lie within 4 kb of operons encoding 
the target peptide synthetases (Borchert et al, 1994; Coderre & Earhart, 1989; Nakano et 
5 al, 1992). However, M tuberculosis pptTis not located near the mbt gene cluster encoding 
the target peptide synthetases involved in mycobactin biosynthesis (Quadri et al, 1998b). 
No gene encoding a PPTase has been identified in any of the antibiotic and phytotoxin 
biosynthetic gene clusters characterised firom Streptomyces spp. (Gehring et al, 1997b) 
and Pseudomonas spp. (Bender et aL, 1999). No evident target gene was found within 
10 1282 bp upstream or 870 bp downstream of xabA, Three cosmids spanning about 100 kb in 
two regions of the genome complemented 56 of 58 tested Tox" mutants ofX. albilineans, 
but not LSI 56 (Rott et aL, 1996). These results indicate ihdixabA is not clustered with the 
genes encoding the antibiotic synthetases that it activates. 

Expression of xabA (or an alternative PPTase such as entD) is essential for 
1 5 albicidin biosynthesis. The phosphopantetheinyl transferase gene described herein provides 
new insight into antibiotic biosynthesis in the Pseudomonadaceae, and new opportunities 
to understand and apply albicidins as potent inhibitors of prokaryote DNA replication. This 
gene, together with the xabB provide the means to engineer high level co-expression of the 
albicidin synthetase and its activating PPTase to obtain higher yields of albicidins, and 
20 ultimately to manipulate the elements of this biosynthetic machinery, by mutagenesis or 
otherwise, to produce desired structural variants of this novel antibiotic class. They may 
also indicate a new approach to disease resistance, by engineering plants to interfere with 
the biosynthesis of albicidin toxins, which are key pathogenesis factors for the systemic 
development of leaf scald disease. 

25 EXAMPLES 

A methvltransferase sene is involved in albicidin biosynthesis in Xanthomonas a lbilineans 
Material and Methods 

Bacterial strains and plasmids 

The properties of bacteria and plasmids used in this example are listed in Table 5. 
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Media, culture conditions and antibiotics 

X. albilineans strains were routinely cultured on sucrose peptone (SP) medium at 
28° C (Birch and Patil, 1985b). Escherichia coli strains were used as hosts in cloning 
experiments and were grown on LB medium at 37" C (Sambrook et al., 1989). Broth 
5 cultures were aerated by shaking at 200 rpm on an orbital shaker. Modified YEB medium 
(Van Larebere et al, 1977) was used for patch mating. The following antibiotics were 
added to media as required: kanamycin, 50 Hg/mL; tetracycline, 15 ng/mL; ampicillin, 100 
Hg/mL. 

Assay of albicidin production 

Albicidin was quantified by a microbial plate bioassay as described previously 
(Birch and Patil, 1985b), except that the 10 mL basal layer of LB agar and the 5 mL 
overlayer of 50% LB with 1% agar were supplemented with tetracycline, and E. coli 
DH5a [pLAFRS] was used as the indicator strain. This change avoided interference by 
tetracycUne, which was added to some cultures to ensure retention of pLAFR3 derivatives 
mX. albilineans. 

Routine genetic procedures 

Bacterial genomic DNA and plasmid DNA isolation, gel electrophoresis, DNA 
restriction digests, ligation reactions and transformation were performed by routine 
procedures (Sambrook et al, 1989). DNA firagments were excised fi-om agarose gels and 
20 residual agarose was removed with the BRESAclean™ DNA purification kit (GeneWorks, 
Adelaide). 

DNA sequencing and analysis 

Sequencing reactions were performed by dideoxynucleotide chain termination 
(Sanger et al, 1977) using the BigDye™ Terminator Cycle Sequencing Kit and 373A 
25 DNA sequencer (PE Applied Biosystems) through the Australian Genome Research 
Facihty. Oligonucleotide primers were purchased fiiom GeneWorks (Adelaide). University 
of Wisconsin Genetics Computer Group (UWGCG) programs BLASTP, FASTA, PILEUP, 
and BESTFIT were used through WebANGIS version 2.0 for DNA and protein sequence 
analyses of the GenBank, EMBL, PIR and SWISSPROT databases. 



10 



15 
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Recovery of the downstream sequence of truncated xabC by IPCR 

Genomic DNA of X albilineans LSI 55 was digested with Ncol, Following 
phenol/chloroform extraction and ethanol precipitation, the digested DNA was self-Ugated 
at a concentration of 0.5 ng/mL. The ligated DNA was precipitated with ethanol and 
5 resuspended in sterile H2O to a concentration of 20 ng/|xL as template for IPCR. Sequence 
. of the 16.5 kb EcoRI fragment including the 5* region ofxabC was used to design primers 
(IF: 5 ' -AAGCGTCGAC ATAGC AGCAG-3 ' and IR: 5 ' - 

CGGCAACGCATTCGACCTCG-3') for IPCR-amplification of the sequence downstream 
of tlie EcdSI site of truncated xabC gene. 

10 IPCR was performed in a volume of 50 jiL with 50 ng of template DNA, 0.4 

ng/|iL of each of primer, 0.2 mM of each of dNTP, 1.8 mM Mg^^ and 1 unit of elongase 
enzyme mix with proof-reading activity (Life Technologies). A 10 min initial denaturation 
step at 94° C was followed by 35 thermal cycles of denaturation at 94° C for 1 min, 
annealing at 55° C for 1 min, and extension at 72° C for 1 min per 1 kb of expected 

15 ampUfication product. The IPCR product was cloned into pZErO-2 to give pZIXC, Clones 
of construct pZIXC from three independent PGR reactions were sequenced to rule out the 
possibility of PCR-generated errors. 

Insertional mutagenesis 

An internal 625 bp ClaVEcdBl fragment oixabC (Figure 13) was firstly cloned 
20 into C/aI/£:caRI-digested pBluescript II SK to provide a Kpnl restriction site, then 
subcloned into ^c^7RI/A:/?wI-cleaved pJP5603 to yield pJP-BEC. The inserts in pBluescript 
II SK intermediates (pBEC) were sequenced to confirm the expected clones. 

The suicide construct pJP-BEC was transferred from the mobilising strain E. coli 
S17-1 (Xpir) into X. albilineans LS155. Exconjugant colonies were selected on SP agar 

25 containing kanamycin and ampicillin. Insertional disruption in xabC or thp was verified by 
PGR using primers flankmg the expected integration site of pJP-BEC or pJP-BAS and 
extension at 72° C for 1 min as previously described (Zhang and Birch, 1997b). The effect 
on albicidin biosynthesis was detemained using the microbial plate assay. Representative 
(Tox") insertional mutants in xabC (LS-JPl) and thp (LS-JP2) were retained for further 

30 analysis. 
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Construction of expression vectors 

The coding region of the xabC gene was amplified firom X. albilineans LSI 55 
chromosomal DNA by PGR. Primer A3F (5^-CGGGATCCCATGGATTCAGCGTTACC- 
30 contained a BamHI restriction site (underlined) for insertion of the amplified gene into 
5 the correct reading firame of lacZ in pLAFR3. Primer A3R (5'-CCC AAGCTT TCATTAT 
GGGGCCCTCTTGC-30 introduced a HindSR restriction site (underlined). The amplified 
DNA was digested with BamSL and Hin6Sl, and Ugated with 5a;MHI/7ymdni-digested 
pLAFRS to result in pLXABC. X, albilineans Tox" mutant LS157 contains a single Tn5 
insertion, in a 4.1 kb Clal restriction firagment or a 16.5 kb EcoRI restriction firagment 

10 (Figure 12). Selection for kanamycin resistance, following shotgun cloning of Clal 
- -restriction firagments of LSI 57 DNA into pBluescript H SK, yielded clone pBC157. 
Sequences Ranking the Tn5 insertion in LSI 57 DNA were amplified by inverse PGR, and 
cloned into pZErO-2, producing pZIL and pZIR. The double-strand sequence of the 16,51 1 
bp EcoRI genomic firagment in pLXABB was obtained by a primer-waDdng approach, 

15 using subclones pBC157, pZIL, pZIR, pSEBL, and pSEBR. The Tn5 insertion in the 
genome of LS157 is accompanied by 9-bp perfect repeat sequence (GTCCTGAAG), 
commencing at 2490 bp m GenBank accession no. AF239749. 

Genetic complementation of albicidin biosynthesis 

DNA transfer between E. coli donor (JM109 pLAFR3 ± insert) and X. albilineans 
20 recipient (LS-JPl or LS-JP2), was accomplished by triparental transconjugation with 

helper strain pRK2013. Mid-log-phase cultures of the recipient were spotted onto agar 

plates containing YEB medium with no antibiotics (20 )iL per spot). After the liquid was 

absorbed by the agar, 20 [aL of mid-log-phase culture of the helper was added to each spot. 

The liquid was again allowed to absorb, and 20 jxL of mid-log-phase culture of the donor 
25 was added to each spot. After incubation of the mating plates overnight at 28*" C, 

transconjugants were selected on SP plates supplemented with ampicillin, and tetracycline 

or spectinomycin. 

Transconjugants were tested for albicidin production using the microbial plate 
bioassay. The constructs pLXABB, pLXABC were designed to test complementation in 
30 trans. However, complementation could also occur in cis, by homologous recombination 
between the complementing construct and the insertionally mutated chromosomal gene. To 
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exclude this possibility, the retention of the insertion in xabC was confirmed by PGR, 
using primers firom apM (in the insertion) and xabB (adjoining xabC in the chromosome). 

Results and Discussion 

ninnin g and sequencing of the full-length j:a6C gene 
5 Downstream by 45 bp from the TAG stop codon of xab£ is the start of an ORF 

(designated xabQ in the same orientation. The 639-bp sequence downstream of the EcoKL 
site of the truncated xabC was amplified from wt X albilineans LSI 55 using IPCR. The 
double-strand nucleotide sequence of 1515 bp from the stop codon of xabB to the Ncol site 
downstream of xabC (Figure 13) is deposited in GenBank under accession no. AF239750. 

10 The xabC ORF encodes a protein of 343 aa (Mr 37,704). One TCTG-hke sequence 
(TGTG) and one typical TCTG box characteristic of many factor independent termination 
sites (Brendel and Trifonov, 1984) occur downstream of the termination codon (TAA) of 
xabC (Fig. 2). However, the other features typical of such terminators (a region of GC rich 
dyad symmetry, followed by a run of consecutive thymine residues) are not present within 

15 435 bp downstream of the xabC stop codon. 

XabC is similar to Q-methvltransferases 

The deduced product of xabC shows 22-30% overall identity and 52-60% overall 
similarity to a family of methyltransferases that utilise S-adenosyl-methionine (SAM) as a 
co-substrate for 0-methylation of small molecules (Ingrosso et al, 1989; Haydock et al, 

20 1991; Kagan and Clarke, 1994). These enzymes include tetracenomycin polyket^.de C-8 0- 
methyltransferase (TcmO, P39896) and C-3 0-methyltransferase (TcmN, P16559) of 
Streptomyces glaucescens, hydroxyneurosporene-O-methyltransferase (PI 7061) of 
Rhodobacterium capsulatus, and hydroxyindole-O-methyltransferases of rat pineal and 
retina (009179) and chicken pineal gland (Q92056). Three highly conserved motifs in 

25 SAM-dependent methyltransferases are also present in XabC as shown in Figures 13 and 
14. The crystal structure analysis for the methyltransferase-SAM complex (Schlukebier et 
aL, 1995) provides firm structural evidence for the role of motif I in SAM binding. 
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Tn^r^rtinnal miita penesis of rfl/>C blocks alhiddin biosvnthesis 

Insertional mutation in «z6C was accomplished using suicide-vector pJP-BEC and 
confinned by PGR. Six out of eight tested transconjugants were verified by PGR to contain 
insertional mutations in xabC. Albicidin production was undetectable in these insertional 
mutants, compared to wt X albilineans LS155 control. The other transconjugants may 
result from integration of the vector at other genomic locations by illegitimate 
recombinations as reported previously (Penfold andPemberton, 1992). 



Complementation test ^ 

Introduction of the xabC gene in pLXABC or the truncated xabC gene in 

10 pLXABB into insertional mutant LS-JP2 restored albicidin production to the level of the 
wt parental strain LS155. This indicates tiiat xabC is essential for albicidin production in X. 
albilineans. The truncated xabC in pLXABB (SEQ ID NO: 106) encodes 277 residues 
(SEQ ID NO: 107), including all of the tiiree conserved motifs of SAM-methyltransferases, 
and appears fully functional by complementation. The continued presence of an insertion 

15 in the chromosomal locus was confirmed by PGR. Thus, complementation was operating 
in trans. This also indicates that no other cishron downstream of xabC is required for 
albicidin production, because insertional mutagenesis typically causes polar mutations 
affecting all downstream cistrons in an operon CDe Bruijn and Lupski, 1989). 

Pr.hanr.Rd exnressio n nfxahC resu lts in increased production of albicidins 
20 Derivatives o£X. albilineans shrain LS155, in which an xabC gene, or fragment 

thereof, was introduced in trans, were tested for production of albicidin using the bioassay 
described above. The results, presented in Figure 15, show that expression of xabC cloned 
into pLAFR3 in derivatives of X. albilineans stirain LS155 complements an insertional 
mutation in the chromosomal xabC, and also enhances albicidin production eirly in the 
25 production phase. Expression of the first part of the xabB operon, including the fiiU-length 
xabB and a truncated but fimctional xabC, fiirther enhances albicidin production. 
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The disclosure of every patent, patent application, and publication cited herein is 
hereby incorporated herein by reference in its entirety. 

The citation of any reference herein should not be construed as an admission that 
such reference is available as 'Trior Art" to the instant application 

5 Throughout the specification the aim has been to describe the preferred 

embodiments of the invention without limiting the invention to any one embodiment or 
specific collection of features. Those of skill in the art will therefore appreciate that, in 
light of the instant disclosure, various modifications and changes can be made in the 
particular embodiments exemplified without departing firom the scope of the present 
10 invention. All such modifications and changes are intended to be included within the scope 
of the appended claims. 
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TABLES 
TABLE 1 



Bacterial strains, and plasmids for Example 1 



■Strain or 
plasmids 


* Relevant characteristics • 


" Reference or 
source 


Strains 






E. coli 






DH5a 


O80dlacZAM15. A(lacZYA-argF 


Promega 


JM109 


[F', lacI'ZAM15], A(lac-proAB 


Promega 


TOPIO 


F, A(iiiiT-h3dRMS-mcrBQ, A(are-leu)7697, AlacX74 


Invitrogen 


X, albilineans 






Xal3 


Wild-type albicidin producer from sugarcane (Queensland), Ap' 


Inventor's 
laboratory 


LSI 55 


Wild-type albicidin producer from sugarcane (Queensland), Ap' 


Wall and Birch 


LS157 


LS155::Tn5, albicidin deficient (Tox'), Km' Sf Ap' 


Wall and Birch 
(1997) 


Plasmids 






pBluescript 

nsK 


ColEl origin, E. coli cloning vector, Ap 


Strata (TRTiR 


pZEiO-2 


ColEl origin, E. coli cloning vector, JCm^ 


Invitrogen 


pRK2013 


ColEl origin, IncP, Tra^ helper plasmid, Km^ 


Ditta et al 
(^980) 


pLAFR3 


RK2 origin, Tra", Mob*, broad host-range cosmid, Tc*^ 


Stachclhaus 
atal. (1987) 


pRG960sd 


ColEl origin, broad host-range plasmid, contains promotcrless uidA with 
start codon and Shine-Dalgamo sequence, Sm'Sp^ 


Van den.Edde 
eM/. (1992) 


pBC157 


9.9-kb Clal fragment carrying Tn5 and flanking sequences from LS157, 
in pBluescript 11 SK. Km' Ap' 


This study 


pZIL 


1 .4-kb fragment, inverse PGR amplified from LSI 57 in pZErO-2, Km' 


This study 


pZIR 


6.0-kb fragment, inverse PGR amplified from LSI 57 in pZErO-2, Km' 


This study 


pZTI 


0.9-kb fragment, PGR amplified from LS157 in pZErO-2, Km' 


This study 


pXABB 


1 6,5-kb EcoRI fragment from Xal 3 in pBluescript U SK, Ap' 


Tjis study 


pSEBL 


7.9-kb EcoRI-Spel frament frompXABB in pBluescript n SK, Ap' 


This study 


pSEBR 


8.6-kb EcoRI-SpcI frament frompXABB in pBluescript II SK, Ap' 


This study 
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. StJ-ain or 
plasmids 


' ' Relevant characteristics 


^ Reference or 
source 


pLXABBl 


16.5-kb EcoRI fragmeat from pXABB mpLAFR3 (xabB in the same 
direction as lac), Tc^ 


This study 


pLXABB2 


16.5-kb EcoRI fragment from pXABB in pLAFR3 (xabB in the opposite 
direction to lac), Tc^ 


Ihis stady 


pRG960pl 


206-bp BamHI-Xmal frament inpRG960sd, Sm'Sp' 


This study 


pRG960p2 


690-bp Bamm-Xmal frament in pRG960sd, Sm'Sp' 


This study 
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TABLE2 



Comparison of conserved sequences in veptide synthetases andXabB 



Domain 


. Co?/ e 


Sequence conserved in 
peptide synthetasef • 


Seuuenc^ in JCabS 


Position in 
. Xab (aa) 


Adenylation 


Al 


L(T/S)YXEL 


WSYAQL 


3806-3811 




A2 


LKAGxAYL (V/L) P (Ii/I) D 


FKAGACYVPID 


3851-3861 




A3 


LAYxxYTSG (S/T) TGxPKG 


LACVMVTSGSTGRPKG 


3917-3932 




A4 


FDxS 


FAVS 


3967-3970 




A5 


NxYGPTE 


NNYGCTE 


4063-4069 




A6 


GELxIxGxG (V/L) ARGYL 


GELHVHSVGMARGYW 


4114-4128 




A7 


Y(R/K) TGDL 


YKTGDM 


4152-4157 




A8 


GRxDxQVKIRGxRIELGEIE 


GRQDFEVKVRGHRVDTRQ 
VE 


4170-4189 




A9 


LPxYM{I/V)P 


LPTYMLP 


4239-4245 




AlO 


NGK(V/L)DR 


NGKLDR 


4259-4264 












Ppnridvl carrier 
protein 


PCP 


DxFFxLGG (H/D) S (L/I) 


DNFFALGGHSL 
MDFFAVGGHSV 


4306-4316 
3261-3271 












Condensation 


CI 


SxAQxR(L/M) (W/Y)xL 


TYAQERLWLV 


3333-3342 




C2 


RHExLRTxF 


RHEVLRTRF 
RHEILRTRF 


3381-3389 
4421-4429 




C3 


MHHxISDG(W/V)S 


IHHIISDGWS 
MHHLIYDAWS. 


3456-3465 
4498-4507 




C4 


YXD(F/Y)AVW 


YADYALW 
YADYAIW 


3495-3501 
4538-4544 




C5 


{I/V)GXFVNT(Q/L) (C/A)xR 


IGFFINILPLR 
IGFFINILPLR 


3606-3617 
4649-4659 




C6 


(H/N) QD (Y/V) PFE 


HQSVPFE 
NQALPFE 


3641-3647 
4685-4691 




C7 


RDxSRNPL 


RDSSQIPL 
RDTSRIPL 


3658-3665 
4701-4708 



'^Sourced from reference (Marahiel et aU 1997). 
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TABLE3 



Bacterial strains, and plasmids for Example 2 



Strain or 
plasmids 


Relevant characteristics 


Reference 
or source 


Strains 






E. coli 






DH5a 


<D80dlacZAM15, recAl, eadAl, gyrA96, thi-1, lisdR17(rk*, supE44, 
relAl, deoR, A(lacZYA-argP)U169 


Promega 


JMI09 


[F, traD36, proAB, lacI^ZAMlS], recAl, endAl, gyrA96, thi bsdR17(rk-, 
mt^, supE44. relAl, A(Iac-proAB) 


Promega 


X, albilineans 






Xal3 


Wild-type albicidin producer ftom sugarcane (Queensland), Ap' 


This 

Ic boratory 


LS155 


Wild-type albicidin producer fiom sugarcane (Queensland), Ap^ 


Wall& 
Birch (1997) 


LS156 


LS155::Tn5, albicidin deficient (Tox"), Km' St" Ap' 


Wall& 
Birch (1997) 


Plasmids 






pBluescript n 
SK 


ColEl origin, E. coli cloning vector, Ap' 


Stratagene 


pGEM-T 


ColEl origin, E. coli TA-cIoning vector, Ap' 


Promega 


pRK2013 


ColElorigin, IncP, Tra"^, helper plasmid, Km' 


Ditta et al. 
(1980) 


pLAFR3 


RK2 origin, Tra", Mob"^, broad host-range cosmid. To' 


Staskawicz 
et aL(1987) 


pBEAl 


8.8-kb EcoRI fragment carrying Tn5 and flanking sequences from 
LSI 56, in pBluescript II SK, Km' Ap' 


This study 


pCEAl 


1766-bp fcoRI-C/al fragment frompBEAl in pBluescript 11 SK, Ap' 


This study 


pPEAl 


697-bp Ecd^-Pstl fragment from pBEAl in pBluescript n SK, Ap' 


This study 


pGTAl 


2.1-kb fragment, PGR amplified from LS155 in pGEM-T, Ap' 


This study 


pLXABA 


834-bp EcdM-BamlH fragment (xabA ORF) from pGTAl in pLAFR3, Tc' 


This study 


pLENTD 


630-bp EcoRl-Hin^ni fragment (entD ORF) from DH5D in pLAFR3, Tc' 


ITiis study 
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T ABLE 4 

Similarity between XabA and other PPTases involved in antibiotic and f atty acid 
biosynthesis in bacteria 



Pathway - 


Protein 

^ ■ ■.. 


Organism • 


Specificity 
(A/P)t. 


. Domain i . . 




Domain D . . 


Homology 
' ([D/SIM) 


Albicidin 


XabA 


X . albilineans 


? 


GVGIDLERP-- 


(X)39- 


-fsakeslfka;y 




- 


Eaterobactin 


EntD 


E . coll 




EIGIDIESI-- 


(X)36- 


-FSAKESAFKASE 


35 


9/56.5 






S . f lexneri 


7 


PIGVD1EEI-- 


{X)36- 


-FSAKESAFKAS*? 


35 


0/55.6 






S . typhimurlum 




RIGIDIBKI-- 


(X) 35- 


- F SAKEovTKAr Q 


35 


9/62.0 






S . austin 


7 


RVGVDlEia-- 


(X)3S- 


-FSAKESVYKAlfQ 


36 


1/61.1 


Mycobactin 


PptT 


M . tuberculosis 


P 


SVGIDAEPH- - 


tX)34- 


-FCAKEATYKAWP 


30 


5/55.5 


Surfactin 


Sfp 


B . Biibtilis 


A/Pt 


EIGIDIEKT-- 


(X)3S- 


-WSMKESFIKQE3 


24 


8/48.5 




Pof-1 


B . pumilus 


7 


FVGID1EEI-- 


(X)35- 


-WSMKEAFIKLTG 


19 


B/47.6 


Gramicidin 


Gap 


B.brevis 


Pf 


PVGIDIERI-- 


{X)35- 


-WTIKESYIKAIG 


20 


8/42.0 


Iturin A 


Lpa-14 


B.subtilis 


7 


PIGID1EKM-- 


(X)35- 


-WSMKESFIKQAG 


20 


0/43.4 


Fatty acids 


HI0152 


H . influenzae 


7 


AVGIDIEFP-- 


(X)34- 


-WCLREAVIiKSQG 


19 


7/45.7 




AcpS 


B . coli 




GLGTDIVEI-- 


(X)40- 


-FAVKEAAAKAFG 


16 


5/38.8 






M . tuberculosis 


A 


GVGIDLVSI-- 


(X)41- 


-WAAKEAVIKAWS 


25 


7/47.6 






B. subtilis 


? 


GIGLDITEL-- 


{X)41- 


-FAAKEAFSKAFQ 


25 


5/46.2 


PPTase domain 








(V/I)G(I/V)D (F/W) (S/C/T)XKE(S/A)XXK 
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TABLES 



Bacterial strains, and plasmids for Example 3 



Strain or 
plasmids . 


Characteristics . 


Reference or 
source 


Strain 






E. coll 






DH5a 


OSOdlacZAMIS, A(lacZYA-argF)U169 


Promega 


JM109 


[FS lacI'^ZAMlS], A(lac-proAB) 


Promega 


TOP 10 


F, A(mrr-hsdRMS-mcrBC), A(are-leu)7697, AlacX74 


Invitrogen 


S17-apir 


SI 7-1 lysogenized with Xpir 


Penfold and 
Pemberton(1992) 


X albilineans 






Xal3 


wt albicidin producer from sugarcane (Queensland), Ap^ 


Our laboratory 


LS155 


wt albicidin producer from sugarcane (Queensland), Ap^ 


Wall and Birch 
(1997) 


LS157 


xabB::Tn5, albicidin deficient (Tox*), Km' St^ Ap' 


Wall and Birch 
(1997) 


LS-JPl 


thp::pJP-BAS, albicidin deficient (Tox"), Km^Ap' 


This work 


LS-JP2 


xabC::pJP-BEC, albicidin deficient (Tox"), Km'Ap' 


This work 


Plasmids 






pBluescript EE 
SK 


ColEl origin, E. coli cloning vector, Ap' 


Stratagene 


pZErO-2 


ColEl origin, E. coli cloning vector, Km' 


Invitrogene 


pRK2013 


ColBlorigin, IncP, Tra"*", helper plasmid, Km' 


Dittaera/. (1980) 


pLAFR3 


RK2 origin, Tra', Mob^" broad host-range cosmid, Tc' 


Staskawicz et al. 
(1987) 


pJP5603 


Bacterial suicide vector, Km' 


Penfold and 
Peniberton(1991) 


pZKC 


1 kb IPCR product in pZErO-2, Km' 


This work 


pBAS 


278 bp Apal-Sall fragment of thp inpBluescrpt n SK, Ap' 


This work 
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Strain or 
plasmids 


. ; Characteristics 


Reference or 
source 


pJP-BAS 


284 bp Sau-Kpnl iragnieiii uompoiio in pjrjouo, r^ju 


This work 


pBEC 


625 bp Qal-EcoRI fragment of xabC in pBluescript n SK, Ap' 


This work 




655 bp EcoRI-Kpnl fragment frompBEC inpJP5603, Km*^ 


This work 


pLTHP 


1226 bp EcoRI-BamHI fragment from pLXABB in pLAFRB, 
Tc' 


This work 


pLXABC 


1029 bp xabC ORF an^lified from LS155 inpLAFR3, Tc' 


This work 


pLXABB 


16.5 kb EcoRI fragment from XalB in pLAFRS, Tc' 


This work 
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CLAEMS 

1. An isolated polypeptide comprising at least one domain selected from the group 
consisting of: 

(a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 
5 more of SEQ ID NO: 6 and 8, or variants thereof. 

(b) a jS-ketoacyl synthase (KS) domain comprising a sequence set forth in any one or 
more of SEQ ID NO: 10, 12, 14, 16, 18 aad 20, or variants thereof; 

(c) a i3-ketoacyl reductase (KR) domain comprising the sequence set forth SEQ ID 
NO: 22, or variants thereof; 

10 (d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 

or more of SEQ ID NO: 24, 26 and 28, or variants thereof; 

(e) an adenylation (A) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

(f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 
15 one or more of SEQ ID NO: 50 and 52, and variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 
thereof 

2. The polypeptide of claim 1, wherein the AL domain comprises each of the sequences 
20 set forth in SEQ ID NO: 6 and 8, or variants thereof 

3. The polypeptide of claim 1, wherein the KS domain comprises each of the sequences 
set forth in SEQ ID NO: 10, 12 and 14, or variants thereof. 

4. The polypeptide of claim 1, wherein the KS domain comprises each of the sequences 
set forth in SEQ ED NO: 16, 18 and 20, or variants thereof 

25 5. The polypeptide of claim 1, wherein the A domain comprises each of the sequences set 
forth in SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

6. The polypeptide of claim 1 , wherein the C domain comprises each of the sequences set 
forth in SEQ ID NO: 54, 56, 58, 60, 62, 64 and 66, or variants thereof 

7. The polypeptide of claim 1, wherein the C domain comprises each of the sequences set 
30 forth in SEQ ID NO: 68, 70, 72, 74, 76, 78 and 80, or variants thereof 
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8. The polypeptide of claim 1, wherein the domains are arranged in an N- to C-tenoinal 
direction as follows: AL-ACP-KS-KR-ACP-ACP-KS-PCP-C-A-PCP-C. 

9. The polypeptide of claim 1, comprising the sequence set forth in SEQ ID NO: 2, or 
biologically active fragment thereof, or variant or derivative of these. 

5 10. The polypeptide of claim 9, wherein the variant has at least 60% sequence identity to 
the sequence set forth in SEQ ID NO: 2. 

11. The polypeptide of claim 9, wherein the biologically active fragment is at least 6 amino 
acids in length. 

12. An isolated polypeptide comprising at least a biologically active fragment of the 
10 sequence set forth in SEQ ID NO: 2 or variant or derivative thereof. 

13. The polypeptide of claim 12, wherein the biologically active fragment is at least 6 
amino acids in length, 

14. The polypeptide of claim 12, wherein the biologically active fragment comprises at 
least one domain selected from the group consisting of: 

15 (a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 

more of SEQ ID NO: 6 and 8, or variants thereof 

(b) a )3-ketoacyl synthase (KS) domain comprising a sequence set forih in any one or 
more of SEQ ID NO: 10, 12, 14, 16, 18 and 20, or variants thereof; 

(c) a jS-ketoacyl reductase (KR) domain comprising the sequence set forth SEQ ID 
20 NO; 22, or variants thereof; 

(d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 
or more of SEQ ID NO: 24, 26 and 28, or variants thereof; 

(e) an adenylation (A) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

25 (f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 

one or more of SEQ ID NO: 50 and 52, and variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 
thereof. 
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15. The polypeptide of claim 13, wherein the AL domain comprises each of the sequences 
set forth in SEQ ID NO: 6 and 8, or variants thereof. 

16. The polypeptide of claim 13, wherein the KS domain comprises each of the sequences 
set forth in SEQ ID NO: 10, 12 and 14, or variants thereof. 

5 17. The polypeptide of claim 13, wherein the KS domain comprises each of the sequences 
set forth in SEQ ID NO: 16, 18 and 20, or variants thereof. 

18. The polypeptide of claim 13, wherein the A domain comprises each of the sequences 
set forth in SEQ ID NO: 30, 32. 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

19 . The polypeptide of claim 13, wherein the C domain comprises each of the sequences 
10 set forth in SEQ ID NO: 54, 56, 58, 60, 62, 64 and 66, or variants thereof. 

20. The polypeptide of claun 13, wherein the C domain comprises each of the sequences 
set forth in SEQ ID NO: 68, 70, 72, 74, 76, 78 and 80, or variants thereof. 

21. The polypeptide of claim 12, wherein the variant has at least 60% sequence identity to 
said at least a biologically active fiagment. 

15 22. The polypeptide of claim 12, wherein the variant has at least 70% sequence identity to 
any one of the amino acid sequences set forth in SEQ ID NO: 6, 8, 10, 12, 14, 16, 18, 20, 

22. 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72,74,76, 78 or 80. 

23. An isolated polypeptide comprising at least biologically active fragment of the 
20 sequence set forth in SEQ ID NO: 83, or a variant or derivative thereof. 

24. The polypeptide of claim 23, wherein the biologically active fragment comprises at 
least one of the consensus PPTase sequence motifs set forth in SEQ ID NO: 89 or 93, or 
variant thereof. 

25. The polypeptide of claim 24, wherein the biologically active fragment comprises both 
25 the consensus PPTase sequence motifs set forth m SEQ ED NO: 89 or 93, or variant 

thereof. 
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26. The polypeptide of claim 23, wherein the biologically active fragment comprises the 
intervening sequence between said consensus PPTase sequence motifs, which intervening 
sequence comprises the sequence set forth in SEQ ID NO: 91, or variant thereof 

27. The polypeptide of claim 23, wherein the biologically active fragment comprises a 
5 contiguous sequence of amino acids contained within the sequence set forth in SEQ DD 

NO: 87, or variant thereof 

28. The polypeptide of claim 23, wherein the biologically active fragment is at least 6 
amino acids in length. 

29. The polypeptide of claim 23, wherein the variant has at least 60% sequence identity to 
10 the sequence set forth in SEQ ID NO: 83. 

30. The polypeptide of claim 23, wherein the variant has at least 70% sequence identity to 
any one of the amino acid sequences set forth in SEQ ID NO: 87, 89, 91 or 93. 

31. An isolated polypeptide comprising at least biologically active fragment of the 
sequence set forth in SEQ ID NO: 95, or a variant or derivative thereof 

15 32. The polypeptide of claim 31, wherein the biologically active fragment comprises at 
least one of the consensus methyltransferase sequence motifs set forth in SEQ ID NO: 99, 
101 or 103, or variant thereof 

33. The polypeptide of claim 31, wherein the biologically active fragment comprises all the 
consensus methyltransferase sequence motifs set forth in SEQ ID NO: 99, 101 and 103, or 

20 variant thereof 

34. The polypeptide of claim 31, wherem the biologically active fragment comprises a 
contiguous sequence of amino acids contained within the sequence set forth in SEQ ID 
NO: 105, or variant thereof 

35. The polypeptide of claim 31, wherein the biologically active fragment comprises a 
25 contiguous sequence of amino acids contained within the sequence set forth in SEQ ID 

NO: 107, or variant thereof 



36. The polypeptide of claim 31, wherein the biologically active fragment is at least 6 
amino acids in length. 
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37. The polypeptide of claim 31, wherein the variant has at least 60% sequence identity to 
the sequence set forth in SEQ ID NO: 95. 

38. The polypeptide of claim 31, wherein the variant has at least 70% sequence identity to 
any one of the amino acid sequences set forth in SEQ TD NO: 99, 101 or 103. 

5 39. An isolated polynucleotide comprising a sequence encoding at least one domain 
selected from the group consisting of: 

(a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 
more of SEQ ID NO: 6 and 8, or variants thereof 

(b) a jS-ketoacyl synthase (KS) domain comprising a sequence set forth in any one or 
10 more of SEQ ID NO: 10, 12, 14, 16, 18 and 20, or variants thereof; 

(c) a /S-ketoacyl reductase (BCR) domain comprising the sequence set forth SEQ ID 
NO: 22, or variants thereof; 

(d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 
or more of SEQ ID NO: 24, 26 and 28, or variants thereof; 

15 (e) an adenylation (A) domain comprising a sequence set forth in any one or more of 

SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof 

(f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 
one or more of SEQ ID NO: 50 and 52, and variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
20 SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 

thereof 

40. The polynucleotide of claim 39, wherein the AL domain is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 5 or 7, or variants thereof 

41. The polynucleotide of claim 40, wherein the AL domain is encoded by a nucleotide 
25 sequence comprising each of the sequences set forth in SEQ ID NO: 5 and 7, or variants 

thereof 

42. The polynucleotide of claim 39, wherem the KS domain is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 9, 1 1, 13, 15, 17 and 19, or variants 
thereof 
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43. The polynucleotide of claim 42. wherein the KS domain is encoded by a nucleotide 
sequence comprising each of the sequences set forth in SEQ ID NO: 9, 11 and 13, or 
variants thereof. 

44. The polynucleotide of claim 42, wherein the KS domain is encoded by a nucleotide 
5 sequence comprising each of the sequences set forth in SEQ ID NO: 15. 17 and 19, or 

variants thereof. 

45. The polynucleotide of claim 39, wherein the KR domain is encoded by a nucleotide 
sequence set forth in SEQ ID NO: 21, or variant thereof 

46. The polynucleotide of claim 39, wherein the ACP domain is encoded by a nucleotide 
10 sequeuce set forth in any one or more of SEQ ID NO: 23. 25 and 27, or variants thereof 

47. The polynucleotide of claim 39, wherein the A domam is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 29, 31, 33, 35, 37, 39. 41, 43, 45 and 

47. or variants thereof 

48. The polynucleotide of claim 47, wherein the A domain is encoded by a nucleotide 
15 sequence comprising each of the sequences set forth in SEQ ID NO: 29, 3 1, 33, 35, 37, 39, 

41, 43, 45 and 47, or variants thereof 

49. The polynucleotide of claim 39, wherein the PCP domain is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 49 and 51, or variants thereof 

50. The polynucleotide of claim 39, whereia the C domain is encoded by a nucleotide 
20 sequence set forth in any one or more of SEQ ID NO: 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 

73, 75, 77 and 79, ot variants thereof. 

51. The polynucleotide of claim 50, wherein the C domain is encoded by a nucleotide 
sequence comprising each of the sequences set forth in SEQ ID NO: 53, 55, 57, 59, 61, 63 
and 65, or variants thereof 

25 52. The polynucleotide of claim 50, wherein the C domain is encoded by a nucleotide 
sequence comprisuig each of the sequences set forth in SEQ ID NO: 67, 69. 71, 73, 75, 77 
and 79, or variants thereof 
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53. The polynucleotide of claim 39, comprising the sequence set forth in any one of SEQ 
ID NO: 1 or 3, or a biologically active fragment thereof at least 18 nucleotides in length, or 
a polynucleotide variant of these. 

54. The polynucleotide of claim 53, wherein the polynucleotide variant has at least 60% 
5 sequence identity to any one of the polynucleotides set forth in SEQ ID NO: 1 or 3. 

55. The polynucleotide of claim 53, wherein the polynucleotide variant is capable of 
hybridising to any one of the polynucleotides identified by SEQ ID NO: 1 or 3 under at 
least low stringency conditions. 

56. The polynucleotide of claim 39, wherein the polynucleotide variant comprises a 
10 nucleotide sequence encoding at least one said domain. 

57. The polynucleotide of claim 56, wherein the nucleotide sequence variant has at least 
60% sequence identity to any one or more of the sequences set forth in SEQ ID NO: 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 
57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77 and 79. 

15 58. The polynucleotide of claim 56, wherem the nucleotide sequence variant is capable of 
hybridising to any one of the sequences identified by SEQ ID NO: 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77 and 79 under at least low stringency conditions. 

59. An isolated polynucleotide comprising a sequence encoding at least biologically active 
20 fragment of the sequence set forth in SEQ ID NO: 83, or a variant or derivative thereof 

60. The polynucleotide of claim 59, comprising the sequence set forth in any one of SEQ 
ID NO: 82 and 84, or a biologically active fragment thereof, or a polynucleotide variant of 
these. 

61. The polynucleotide of claim 59, comprising a contiguous sequence of nucleotides at 
25 least 18 nucleotides in length and contained within the sequence set forth in SEQ ID NO: 

86, or variant thereof 

62. The polynucleotide of claim 59, wherein the polynucleotide variant has at least 60% 
sequence identity to any one of the polynucleotides set forth in SEQ ID NO: 82, 84 and 86. 
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es. The polynucleotide of claim 59, wherein the polynucleotide variant is capable of 
hybridising to any one of the polynucleotides identified by SEQ ID NO: 82, 84 and 86 
under at least low stringency conditions. 

64. The polynucleotide of claim 59, wherem the polynucleotide variant comprises a 
5 nucleotide sequence encoding at least one PPTase sequence motif selected firom SEQ ID 

NO: 89 and 93, or variant thereof 

65. The polynucleotide of claim 64, wherein the polynucleotide variant comprises a 
nucleotide sequence encoding the intervening sequence between the said consensus 
PPTase sequence motifs, said nucleotide sequence comprising the sequence set forth in 

10 SEQ ID NO: 91. 

66. The polynucleotide of claim 59, wherein the polynucleotide variant suitably comprises 
a nucleotide sequence encoding a contiguous sequence of amino acids contained within the 
sequence set forth m SEQ ID NO: 87, or variant thereof 

67. The polynucleotide of claim 66, wherein the contiguous sequence is encoded by the 
1 5 sequence set forth in SEQ ID NO: 86, or nucleotide sequence variant thereof displaying at 

60% identity thereto. 

68. The polynucleotide of claim 64, wherem the PPTase sequence motif is encoded by a 
nucleotide sequence comprising the sequence set forth in any one of SEQ ID NO: 88 and 
92, or nucleotide sequence variant thereof displaying at 60% identity thereto. 

20 69. The polynucleotide of claim 65, wherein the intervening sequence is encoded by the 
nucleotide sequence set forth in SEQ ID NO: 90, or nucleotide sequence variant thereof 
displaying at 60% identity thereto. 

70. The polynucleotide of claim 66, wherein the contiguous sequence is encoded by the 
sequence set forth in SEQ ID NO: 86, or nucleotide sequence variant thereof displaying at 

25 60% capable of hybridising thereto under at least low stringency conditions. 

71. The polynucleotide of claim 64, wherein the PPTase sequence motif is encoded by a 
nucleotide sequence comprismg the sequence set forth in any one of SEQ ID NO: 88 and 
92, or nucleotide sequence variant thereof capable of hybridising thereto under at least low 
stringency conditions. 
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72. The polynucleotide of claim 65, wherein the intervening sequence is encoded by the 
nucleotide sequence set forth in SEQ ID NO: 90, or nucleotide sequence variant thereof 
capable of hybridising thereto under at least low stringency conditions. 

73. An isolated polynucleotide comprising a sequence encoding at least biologically active 
5 fragment of the sequence set forth in SEQ ID NO: 95, or a variant or derivative thereof. 

74. The polynucleotide of claim 73, comprising the sequence set forth in any one of SEQ 
ED NO: 94 and 96, or a biologically active fragment thereof, or a polynucleotide variant of 
these. 

75. The polynucleotide of claim 73, comprising a contiguous sequence of nucleotides 
1 0 contained within the sequence set forth in SEQ ID NO: 1 04, or variant thereof. 

76. The polynucleotide of claim 73, comprising a contiguous sequence of nucleotides 
contained within the sequence set forth in SEQ ID NO: 106, or variant thereof. 

77. The polynucleotide of claim 73, wherein the polynucleotide variant has at least 60% 
sequence identity to any one of the polynucleotides set forth in SEQ ID NO: 94, 96, 104 

15 and 106. 

78. The polynucleotide of claim 73, wherein the polynucleotide variant is capable of 
hybridising to any one of the polynucleotides identified by SEQ ID NO: 94, 96, 104 and 
106 under at least low stringency conditions. 

79. The polynucleotide of claim 73, wherein the polynucleotide variant comprises a 
20 nucleotide sequence encoding a methyltransferase sequence motif selected from any one or 

more of SEQ ID NO: 99, 101 and 103, or variant thereof. 

80. The polynucleotide of claim 79, wherein the methyltransferase sequence motif is 
encoded by a nucleotide sequence comprising the sequence set forth in any one of SEQ ID 
NO: 98, 100 and 102, or nucleotide sequence variant thereof displaying at least 60% 

25 identity thereto, 

81. The polynucleotide of claim 79, wherein the methyltransferase sequence motif is 
encoded by a nucleotide sequence comprising the sequence set forth in any one of SEQ ID 
NO: 98, 100 and 102, or nucleotide sequence variant thereof capable of hybridising thereto 
under at least low stringency conditions. 
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82. An expression vector comprising the polynucleotide of any one of claims 39, 59 or 73, 
wherein the polynucleotide is operably linked to a regulatory polynucleotide. 

83. A host cell containing the expression vector of claim 82. 

84. A multipUcity of cell colonies, constituting a Ubrary of colonies, wherein each colony 
5 of the Ubrary contains an expression vector for the production of the polypeptide of claim 1 

or claim 12. 

85. A method for enhancing the level and/or functional activity of an albicidin, said 
method comprising: 

- introducing into an albicidin-producing host cell (1) an agent that modulates 
10 the expression of a gene encoding at least a portion of the polypeptide of claim 1 or 

variant or derivative thereof, or the level and/or functional activity of an expression 
product of said gene, or (2) a vector from which a polynucleotide encoding at least a 
portion of the polypeptide of claim 1 or variant or derivative thereof can be 
translated; 

15 _ and culturing the host ceU for a time and under conditions sufBcient to 

enhance the level and/or functional activity of said albicidin. 

86. The method of claim 85, fiirther comprising introducing into said host ceU a vector 
from which a PPTase can be translated. 

87. The method of claim 86, wherein the PPTase is selected from EntD or XabA. 

20 88. The method of claim 85, further comprising introducing into said host cell a vector 
from which a methyltransferase can be translated. 

89. The method of claim 86, wherein the methylfransferase is XabC. 

90. An antigen-binding molecule that is immuno-interactive with the polypeptide of claim 
1 or claim 12. 

25 91 . An antigen-binding molecule that is immuno-interactive with the polypeptide of claim 
23. 

92. An antigen-binding molecule that is immuno-interactive with the polypeptide of claun 
31. 
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93. A method of preparing a polynucleotide encoding a modified PKS, comprising using a 
nucleotide sequence encoding the polypeptide of claim 1 or claim 12 as a scaffold and 
modifying the portions of the nucleotide sequence that encode enzymatic activities, either 
by mutagenesis, inactivation, deletion, insertion, or replacement. 

5 94. A method for producing polyketides, comprising expressing the modified albicidin 
PKS encoding nucleotide sequence produced by the method of claim 93 in a suitable host 
cell to thereby produce a polyketide different from that produced by said polypeptide. 
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PlFl 

967 GGCGATGCGC GCACACTGCA GGTCCATCAC GCCACCTCCA GCAGGGTGTC 
CCGCTACGCG CGTGTGACGT CCAGGTAGTG CGGTGGAGGT CGTCCCACAG 
AIRA CQli DMM RBS 
xatA^ I 

1017 ATACACGGCC AGCGGATGCT GCAGGTTTTC CACTGGCAGG GCCACTGGCT 
TATGTGCCGG TCGCCTACGA CGTCCAAAAG GTGACCGTCC CGGTGACCGA 

-35 (Pj^ftfl) -10 (Pxow) 

1067 GTCGTAAGGG AAGCGGTGCC TTGAGC GCCG GTGCGGACAG TATAACGACA 
CAGCATTCCC TTCGCCACGG AACTCGCGGC CACGCCTG TC ATATT GCTGT 

-10 (Pw) 

1117 CGTTCCTTGG CCAAGCGCAC TGTCGGCACG GCCTTGCTGA TGCCGCCCAT 
GCAAGG AACC GGT TCGCGTG ACAGCCGTGC CGGAACGACT ACGGCGGGTA 
-35 (P«L^) 

1167 GTAGCCGCGC GCCTGGATCT CGCGTAGTAG CACCACGCTG GCCGGGATCC 
CATCGGCGCG CGGACCTAGA GCGQAWGMcVQTGa^GCGAe 

FIR 

RBS I ' ^ 

1217 AT CGAGG GCG CGCTTGCCCA ATGCGCTCAT GCAGATAACT CTTGTAGCCG 
TAGCTCCCGC GCGAACGGGT TACGCGAGTA CGTCTATTGA GAACTACGGC 
MP NAL MQIT LVA 
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{i).AL 



TSGSSGESKGILLSH- -GYFRTGDL Xal-XabB(AL) 
TGGTTGVAKGAMLTH--GWMATGDr Hin-LCFA 
TSGSTGTPKAVMLNH--GWFETGDL Bsu-PksJ 
SSGSTGDPKGVMLTH- -GWVKTGDL Bsu-MycA(AL) 
SSGTTGLPKGVMLTH--GWLHTGDX Pcr-ConiL2 
TS GTTGRPKGWS AQ - - GWYRTGDIj Snia-FkbB(AL) 
TSGTTGRPKGWSTQ--GWFRTGDIf Aine-RifA.(AL) 
TS GTTGTPKGVLS TQ - - GWYRTGDL Shy-RapA(AL) 



(ii). KS 

GPSEVINSACSSSLVAL- - VELHGTGTSL- -ALGHLGAAAG Xal-XabB (KS 1) 
GPSnAVDTACSASLTAl--XEAHGTGTVL--NIGHAESAAG Xal-XabB (KS2) 
GPSLFVHTNCSSSIiSAIj- -VEAHGTGTLL- -NLGHLDTVAG Mxa-Tal 
GPAVTVDTACSSSLVAV- -lEAHGTGTKL- -NIGHLFEAAG Bsu-MycA 
GPAVTVDTACSSSIiVAL- -VEAHGTGTRL - -NI GHAQAAAG Ser-EryAl 
GPAMTVDTACSSGLTAL- -VEAHGTGTRL- -NIGHTQAT^G Ser-EryA3 
GPSVLVDTACSGGLTAL- -VECHQTGTQA--NIGHLEGASG Che-PKSl 
GPSLAVDTACSSSLTAI - -LEAHGTGTAL- -NIGHCESAAG Bsu-PksM 
GP S VAVDTACS S S LVAI - - VEAHGTGTLL - - NLGHTEAAAG MtU-PpsA 
GPSLTIDTACSSSLMAL- -VEAHGTGTKV- -NMGHPEPASG Chick-FAS 
GPSXALDTACSSSLLAL- -lEAHGTGTKV- -NMGHPEPASG Rat-FAS 
* * * 

(Active site cysteine) (Active site histidine) 



(iii). KR 

VYWIGGAGGLGEVLSEHLIRTYD.AQLIWIGR Xal-XabB 
VyVISGGTGALARLFVAEIGKRATRATVILVAR Mxa-Tal 
TVLVTGGTG6VGGQI ARWLARRG . APHLLLVSR Ser-EryA 1 
TVLVTGGTGGIGAHLARWLARSG.AEHLVLLGR Ser-EryA3 
S YLLVGGVGGLGSATALAMSTRG . ARHLLLINR Che-PKS 1 
SYIITGGLGGLGLFFASKLAAAG . CGRIVLTAR Mtu-MAS 
SYIITGGLGGFGLELAQWLIERG . AQKLVLTSR Chick-FAS 
S YI ITGGLGGFGLELARWLVLRG . AQRLVLTSR Rat-FAS 



(iv). ACP 

CELALDSLQCVR Xal-XabB(ACPl) 
EYYGVDSIVAIE Xal-XabB (ACP2) 
ESYGVDSIVIIE Xal-XabB (ACP3) 
IGFGLDSIMLTQ Bsu-MycA 
ERYGIDSIIITQ Mxa-Tal 
AELGVDSLSALE Ser-EryAl 
QDYGIDSLVAVE Che-PKSl 
lEYGLDSLGMLE Mtu-MAS 
ADLGLDSLMGVE Chick-FAS 
ADLGLDSLMGVE Rat-FAS 

* (Active site serine) 
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AL ACP KS ACP 



C. Yersinia pestisEMWPl (3163 aa) 











KS AT 




KR ACP 











D. M xanthiis Tal (2392 aa) 



~C^E=/fc==rReP-B KS KR ACP KS 



E. B. subtilis PksorfX6 (4447 aa) 
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DH 


KR 


ACP 


KS 


KR 
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10S4 TTCCCGCCCGAATAGGCGCAGGAAGCCAATAAGTATGGCAGCGCCCTTGACCiUVTGACA^ 

1135 GCTCTGCTCCX3CGTCGTCCATCGCCATTCCGCCCCTCCCCGACCCCAAGCATCGACCAAAG(^ 

1216 CGCGACTCTGCGACACTAGCGCAATGTTflTCGTCGACATTGACGCra 

-10 RBS 

M P N A V S 

1297 ACCGATGCAGGGCGCGCGGGGACTCCCGCAGCCGCAAGCGATGAACCCAGGGTTGCCGAGCGTCGGCGGC^^ 

PMQGARGLPQPQ AMNPGLPSVGGLSAG32 

1378 CCAGCCATTGCAGTTGTCGTTAGCACCGGAACTGCAGGCAGCCGCGCGCAGTGCCCACCGCCATCTGCTCGACGACGGCAC 

QPLQLSLAPELQAAARSAHRHLLDDGT59 

14 S 9 GGCGCTTTACCTGCrGGCGXTCGATACCGCGCAATTCGACCCGGGGGCTTTCGOSGCAATG 

ALYLLAFDTAQPDPGAFAAMAIARPDSSC 

1540 CATCGCCCGCAGCGTGCGCAAGCGTCAGGCCGAGTTCCTGTTCGGCCGTCTGGCCGCGCGACTGGCGCTGCAAGAGGTGC 

IARSVRKRQAEPI.FGRLAARLALQEVL113 

1621 GGGACCTGCGCAAGCGCAGGCAGATATTGCAATCGGCGCGACGCGCGCGCCCTGCTGGCCTGCCGGCAGCCTGGGC^^ 

GPAQAQADIAIGATRAPCWPAGSLGSI140 

1702 TTCCCATTGCGAGGACTACGCGGCCGCCATCGCCATGGCGGCCGGCACCCGCCACGGCGTGGG^^ 

SHCEDYAAAIAMAAGTRH G tV G I P] L E R P 167 

1783 AATCACACCCGCGGCGCGCGCGGCGTTGCTGAGCATCGCAATCGATGCCGACGAAGCCGCTCGTCTGGCAAAGGCGGOVGA 

ITPAARAALLSIAIDADEAARLAKAAD 194 



1854 CGCGCyVGTGGCCGCAAGACCTGCTGCTGACCGCACTATTTT 

AQWPQDLLLTAL |F~g A |K E S) LF@AAYSAV221 

1945 CGGACGCTACTTCGACTTCAGCGCGGCACGCCTGTGCGGCATCGACCTGGCACGGCAATGCCTGCA^ 

GRyFDFSAARLCGIDLARQCLHLRLTE248 

2026 GACACTCTGCGCGCAATTCGTGGCCGGGCAAGTGTGCGAGGTCGGCTTCGCGCGCCTACCACCGGACCTGGTGCTCACCCA 

TLCAQFVAGQVCEVGFARL PPDLVLTH 27S 

2107 C'TT\^nr^^r.n'rni.r,rTirarar.ACAGTCGM^CCCGCC^ 

YAW* 2"^^ 

2188 AAG CTCTC CCCGCAGCCGCACTCGGCGGTGGCATTCGGATTGCGGAACACGAAGGTCTCACCCA^ 

2269 TCGATTTCGGTGCCATCGACCAACTGCAGACTGGTOGCATCGACATAAATCCGCACTCCGTCCTGCTCGAACACCGCATCG 
2350 TCCGCGCGTGCCTCGTGCGCCAGATCGGTGACATGGCCCCAACCGGAACAGCCTGTGCGTACCACCCCGAAACGTAGACCC 
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CJal RBS (MF) 



Xfl*C HDSALPTSAFTPDLPyTT 
V N 30 ^ 

occtactatc«jictxk:c(Xagtcaaggcxxjcga 

"^R TAAVKAAIBLGLPDVVCQOGRTPAAXAEACOASPR 

G 60 • 

Clal 

ATTCGCATCCTTOCTATTACCTACTATCGATC^^ 

TT r\ cYVLVSIOPLRRHCGLPylDRNMAMVLDRSSPGYL 
C 100 

GCCAGCATCAACTTCCTGCTCl^CrTACATCATGACCCCCT^^ 

ACTCG 4fiS^ pi^i.spyjHSAFTDLTAVVRTGKIBLAQDOVVAPDH 
p 140 

CAGTtaGTXKAATnPCACGCGCGATXgKZACCGATMTC^^ 

pARAMAPMHALPSALIANMVSLPADRPIR [V ^ P V 
^ I"" «oti£ 

»CCCCCTCnTOGCATCGCCTlXXXMCAGCCCrr^^ 

"^FOIAFAQRFRQABVSPLDHDNVLDVAREMAQAAKV 

(IR) Hindi 
CGAGCCKGTrrcCTGCCXXKCflACGCATT^^ 
TGGC^ 625 

BARFLPONAPOLDYG ts G Y D V I L ij TNFLHHFDEVOGERI 
L A 260 

EcoRZ Hotie 11 

AAGACGCGCGATCCGCTGAAOTACGACCCCATGtntWTCACTlT^ 
CCACC 945 

K T R 0 A [I. N D P G M V FBPIADEERSSPPI.AATPSMHMLG 

Hotlf III 

CCCGCGGCXMAGTOTACACCTATAGCGATCrGGAAACGATCTTTCGGCATGCCGGC^ 
GCAAG 1065 

GESYTYSDLERMFRHAOPGHVELiCSlPPALI.KVVVSR 
K 340 

ACXKXTCCCATAATGATCGAATCGGCGACATCCCCTGTCGCGAAAACCGAGCGCA 

ccTGc lies 

TCCCGGGGTATTACT tA3R) 

RAP* 

343 

TCGGTATACGCATGATCGAGATCGGCTCGGACThTCTCGTCTCCTGCATGT^ 
CGAGG 13 05 

CTACCGGOlGCATCCaWCCKXATXnXKrGTCCCGGCC^^ 
gCGGC 1425 

CGCTGCACATAGGGGCCTTGACCCJ^TATGGCAGATGCGCATCTAT^ 
1515 
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Xal-XabC 
Sgl-TcmO 
Sgl-TcmN 
Smy-MdmC 
Mxa-SafC 
Ser-EryG 
Spe-DauK 
Sal-DmpM 
Shy-RapM 
Sav-AveD 



174 VLDVAAGHG 
173 FVDLGGARG 
331 lADLGGGDG 
64 VLEIGTFTG 
63 TLEVGVFTG 
85 VLDVGFGLG 
183 VLDVGGGKG 
208 WDIGGADG 
106 VLEVGCGMG 
71 VLDVGCGSG 

Motif I 



23 6 SGYDVILL 
234 PRADVFIV 
393 TGYDAYLF 
135 GAFDIVFV 
134 GTFDLAFI 
149 ETFDRVTS 
254 RKADAIIL 
269 GGGDLyVL 
155 VQGDAEEL 
124 GSFDAAWA 

Motif II 



267 ALNDDGMVIT 
263 ALTPGGAVLV 
423 IGDDDARLLI 
159 LVRPGGLVAI 
15 8 LVRPGGLIIL 
178 VLKPGGVLAI 
273 ALEPGGRILI 
2 98 AMP7VHARLLV 
194 ALRRGGALSH 
151 VLRPGGRLAV 

Motif III 
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